/g/ - /ldg/ - Local Diffusion General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/ldg/ - Local Diffusion Genera(...) 10/10/24(Thu)13:53:14 No.102764387

File: the longest dick general.jpg (3.62 MB, 3264x1534)

3.62 MB JPG

/ldg/ - Local Diffusion General Anonymous 10/10/24(Thu)13:53:14 No.102764387 Archived

Discussion of free and open source text-to-image models

Previous /ldg/ bred : >>102744592

Chink Edition

>Beginner UI
Fooocus: https://github.com/lllyasviel/fooocus
EasyDiffusion: https://easydiffusion.github.io
Metastable: https://metastable.studio

>Advanced UI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
reForge: https://github.com/Panchovix/stable-diffusion-webui-reForge
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://aitracker.art
https://huggingface.co
https://civitai.com
https://github.com/Nerogar/OneTrainer
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/kohya-ss/sd-scripts/tree/sd3

>Flux
https://replicate.com/black-forest-labs/flux-1.1-pro
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/comfyanonymous/hunyuan_dit_comfyui
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Maintain thread quality
https://rentry.org/debo

>Related boards
>>>/aco/sdg
>>>/aco/aivg
>>>/b/degen
>>>/c/kdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/tg/slop
>>>/trash/sdg
>>>/u/udg
>>>/vt/vtai

Anonymous
10/10/24(Thu)13:54:32 No.102764398

Anonymous 10/10/24(Thu)13:54:32 No.102764398

Blessed thread of frenship

Anonymous
10/10/24(Thu)13:55:39 No.102764413

Anonymous 10/10/24(Thu)13:55:39 No.102764413

>>102764387
What's bigma?

Anonymous
10/10/24(Thu)13:59:53 No.102764470

Anonymous 10/10/24(Thu)13:59:53 No.102764470

Qu2t hallucinating.

Anonymous
10/10/24(Thu)14:02:01 No.102764498

Anonymous 10/10/24(Thu)14:02:01 No.102764498

File: 1705772579350143.png (1.26 MB, 1024x1024)

1.26 MB PNG

Anonymous
10/10/24(Thu)14:02:30 No.102764504

Anonymous 10/10/24(Thu)14:02:30 No.102764504

>>102764413
pixart bigma

Anonymous
10/10/24(Thu)14:08:05 No.102764575

Anonymous 10/10/24(Thu)14:08:05 No.102764575

https://github.com/AIFSH/PyramidFlow-ComfyUI?tab=readme-ov-file
How much VRAM does it ask for?

Anonymous
10/10/24(Thu)14:08:37 No.102764583

Anonymous 10/10/24(Thu)14:08:37 No.102764583

>>102764413
i dunno wassa bigma with you?

Anonymous
10/10/24(Thu)14:10:02 No.102764595

Anonymous 10/10/24(Thu)14:10:02 No.102764595

>>102764575
>>102760652
>https://github.com/jy0205/Pyramid-Flow/issues/12#issuecomment-2404752801
>>The 384p version requires around 26GB memory, and the 768p version requires around 40GB memory (we do not have the exact number because the cache mechanism on 80GB GPU)

Anonymous
10/10/24(Thu)14:11:03 No.102764611

Anonymous 10/10/24(Thu)14:11:03 No.102764611

>>102764387
>/ldg/ returning to it's chang roots
nature is healing

Anonymous
10/10/24(Thu)14:11:44 No.102764620

Anonymous 10/10/24(Thu)14:11:44 No.102764620

File: 0.jpg (269 KB, 832x1216)

269 KB JPG

Anonymous
10/10/24(Thu)14:21:49 No.102764727

Anonymous 10/10/24(Thu)14:21:49 No.102764727

AMD unveils new AI chips to compete with Nvidia.

Anonymous
10/10/24(Thu)14:22:04 No.102764732

Anonymous 10/10/24(Thu)14:22:04 No.102764732

File: ComfyUI_temp_uyzyp_00040_.png (2.23 MB, 1072x1880)

2.23 MB PNG

Anonymous
10/10/24(Thu)14:23:57 No.102764754

Anonymous 10/10/24(Thu)14:23:57 No.102764754

>>102764727
we're more likely to see a completely new AI company making hardware from China than AMD seriously competing in AI

Anonymous
10/10/24(Thu)14:25:05 No.102764768

Anonymous 10/10/24(Thu)14:25:05 No.102764768

>>102764727
it's useless, they'll always be below Nvdia because of CUDA

Anonymous
10/10/24(Thu)14:26:03 No.102764785

Anonymous 10/10/24(Thu)14:26:03 No.102764785

>>102764768
Make the chips compatible with CUDA. Simple. Right?

Anonymous
10/10/24(Thu)14:27:04 No.102764800

Anonymous 10/10/24(Thu)14:27:04 No.102764800

>>102764785
it's been more than 5 years they tried that, they got somewhere but it's still not close
https://github.com/vosen/ZLUDA

Anonymous
10/10/24(Thu)14:36:50 No.102764930

Anonymous 10/10/24(Thu)14:36:50 No.102764930

File: file.png (3.58 MB, 1287x1788)

3.58 MB PNG

Babe wake up, they improved SDXL
https://huggingface.co/comin/IterComp
https://civitai.com/models/840857/itercomp

Anonymous
10/10/24(Thu)14:45:13 No.102765034

Anonymous 10/10/24(Thu)14:45:13 No.102765034

>>102764930
>SDXL
I sleep. Why are people wasting so much money on an objectively bad architecture.

Anonymous
10/10/24(Thu)14:46:44 No.102765054

Anonymous 10/10/24(Thu)14:46:44 No.102765054

>>102764595
GGUF when?

Anonymous
10/10/24(Thu)14:47:12 No.102765061

Anonymous 10/10/24(Thu)14:47:12 No.102765061

>>102765034
Ikr, today we got that video model that uses SD3 (to be fair they said they're retraining everything from scratch) and now this IterComp for SDXL, it's flux who needs love, not deprecated models

Anonymous
10/10/24(Thu)14:48:45 No.102765083

Anonymous 10/10/24(Thu)14:48:45 No.102765083

>>102765061
I wouldn't say no to someone retraining SD3 or just making a 3B model.

Anonymous
10/10/24(Thu)14:54:14 No.102765140

Anonymous 10/10/24(Thu)14:54:14 No.102765140

File: 1712031233757673.png (1.08 MB, 896x1152)

1.08 MB PNG

Anonymous
10/10/24(Thu)14:55:08 No.102765147

Anonymous 10/10/24(Thu)14:55:08 No.102765147

https://github.com/jy0205/Pyramid-Flow
https://huggingface.co/spaces/Pyramid-Flow/pyramid-flow
there's a demo now

Anonymous
10/10/24(Thu)14:56:59 No.102765164

Anonymous 10/10/24(Thu)14:56:59 No.102765164

File: 1717434688440756.png (1.6 MB, 896x1152)

1.6 MB PNG

Anonymous
10/10/24(Thu)14:59:47 No.102765198

Anonymous 10/10/24(Thu)14:59:47 No.102765198

File: 1724849575722123.png (752 KB, 896x1152)

752 KB PNG

Anonymous
10/10/24(Thu)15:03:19 No.102765231

Anonymous 10/10/24(Thu)15:03:19 No.102765231

File: 1700165491476736.png (757 KB, 896x1152)

757 KB PNG

Anonymous
10/10/24(Thu)15:03:40 No.102765236

Anonymous 10/10/24(Thu)15:03:40 No.102765236

File: file.webm (368 KB, 1280x768)

368 KB WEBM

>>102765147
If you go for 24fps you'll only get 1 sec lol

Anonymous
10/10/24(Thu)15:07:31 No.102765279

Anonymous 10/10/24(Thu)15:07:31 No.102765279

File: fs_0082.jpg (66 KB, 920x920)

66 KB JPG

>>102765147
>>102765236
tried a few times to get this to look at the camera but it just sorta wiggled like yours each time :/
oh well not paying gpu minutes to try more, will wait for it to run in under 24gb

Anonymous
10/10/24(Thu)15:08:16 No.102765289

Anonymous 10/10/24(Thu)15:08:16 No.102765289

File: file.webm (741 KB, 1280x768)

741 KB WEBM

>>102765236
>>102765147
went for 8 fps and... kek

Anonymous
10/10/24(Thu)15:09:39 No.102765304

Anonymous 10/10/24(Thu)15:09:39 No.102765304

>>102765061
wow, surprise, turns out all the people who know what they're doing came to the conclusion that flux is rigid, overhyped, and not worth the training costs. it's simply not a 12b-tier model. bloated with synthetic garbage and still requires sdxl refiner to unslop. bake again

Anonymous
10/10/24(Thu)15:09:42 No.102765305

Anonymous 10/10/24(Thu)15:09:42 No.102765305

File: file.png (78 KB, 2230x498)

78 KB PNG

>>102765279
>will wait for it to run in under 24gb
this model will be history anyway, they're retraining it from scratch to get the best model possible
https://github.com/jy0205/Pyramid-Flow

Anonymous
10/10/24(Thu)15:10:31 No.102765314

Anonymous 10/10/24(Thu)15:10:31 No.102765314

bigma will save us

Anonymous
10/10/24(Thu)15:11:06 No.102765323

Anonymous 10/10/24(Thu)15:11:06 No.102765323

>>102765304
>turns out all the people who know what they're doing came to the conclusion that flux is rigid, overhyped, and not worth the training costs.
and so for you going for the most broken base model ever (SD3M) was a good idea? get the fuck out of there

Anonymous
10/10/24(Thu)15:12:43 No.102765339

Anonymous 10/10/24(Thu)15:12:43 No.102765339

>>102765304
>flux is rigid, overhyped, and not worth the training costs. it's simply not a 12b-tier model. bloated with synthetic garbage
All the CFG antiburners are cope too

Anonymous
10/10/24(Thu)15:14:56 No.102765365

Anonymous 10/10/24(Thu)15:14:56 No.102765365

>>102765339
>All the CFG antiburners are cope too
good thing we don't need any CFG antiburners anymore with the undistilled models
https://huggingface.co/nyanko7/flux-dev-de-distill
https://huggingface.co/ashen0209/Flux-Dev2Pro

Anonymous
10/10/24(Thu)15:15:24 No.102765370

Anonymous 10/10/24(Thu)15:15:24 No.102765370

>>102765279
what is putting this thing so high VRAM? The models seems all under 10GB

Anonymous
10/10/24(Thu)15:15:27 No.102765371

Anonymous 10/10/24(Thu)15:15:27 No.102765371

>>102765305
Wow, SD3 is so shit that even CCP reject this shit.

Anonymous
10/10/24(Thu)15:15:35 No.102765372

Anonymous 10/10/24(Thu)15:15:35 No.102765372

Anonymous
10/10/24(Thu)15:17:55 No.102765402

Anonymous 10/10/24(Thu)15:17:55 No.102765402

>>102765370
because making pictures asks for vram anon, and 24fps + 10 sec means 240 pictures that have to be rendered at the same time, it's like you went for 240 batch size on SD models

Anonymous
10/10/24(Thu)15:19:07 No.102765424

Anonymous 10/10/24(Thu)15:19:07 No.102765424

>>102765365
once someone makes a killer full real fintune then ill be interested

Anonymous
10/10/24(Thu)15:23:35 No.102765481

Anonymous 10/10/24(Thu)15:23:35 No.102765481

>>102765083
>I wouldn't say no to someone retraining SD3 or just making a 3B model.
I would take retrained 1.5 at this point. V-prediction if possible

Anonymous
10/10/24(Thu)15:24:38 No.102765498

Anonymous 10/10/24(Thu)15:24:38 No.102765498

>>102765481
Just train Pixart Sigma then.

Anonymous
10/10/24(Thu)15:25:31 No.102765513

Anonymous 10/10/24(Thu)15:25:31 No.102765513

>>102765481
> retrained 1.5
why? unet is definitely inferior to a DiT architecture

>V-prediction
what's that?

Anonymous
10/10/24(Thu)15:27:26 No.102765537

Anonymous 10/10/24(Thu)15:27:26 No.102765537

>>102765402
gotcha. I have two cards. I have been looking for a way to split the VRAM requirements. I am not seeing anywhere if that is supported. Models on one card and processing on the other seems like you could get 26GB pretty easily. The larger two text encoders are enough to drop it below 24GB.

Anonymous
10/10/24(Thu)15:28:21 No.102765548

Anonymous 10/10/24(Thu)15:28:21 No.102765548

>>102765537
>The larger two text encoders are enough to drop it below 24GB.
what text encoders are they using? T5?

Anonymous
10/10/24(Thu)15:31:07 No.102765581

Anonymous 10/10/24(Thu)15:31:07 No.102765581

>>102765548
I have no idea. There are just folders named text_encoder_1, text_encoder_2 and text_encoder_3. I don't see them used in the code either so I am not sure what is going on. I assume you need them, but I haven't dug that far.

Hopefully another anon will know.

Anonymous
10/10/24(Thu)15:33:44 No.102765616

Anonymous 10/10/24(Thu)15:33:44 No.102765616

File: file.webm (845 KB, 640x384)

845 KB WEBM

>>102765147
>based on SD3M
yeah I can see that

Anonymous
10/10/24(Thu)15:35:46 No.102765642

Anonymous 10/10/24(Thu)15:35:46 No.102765642

File: mrnu3fzcavtd1.jpg (154 KB, 1178x706)

154 KB JPG

better start saving up bros

Anonymous
10/10/24(Thu)15:37:36 No.102765673

Anonymous 10/10/24(Thu)15:37:36 No.102765673

>>102765642
who the fuck is gonna buy the 5080 and the 5070? Do they pretend the 3090 and the 4090 doesn't exist?

Anonymous
10/10/24(Thu)15:37:53 No.102765680

Anonymous 10/10/24(Thu)15:37:53 No.102765680

Is there any video model that doesn't do the thing where when you give it a painting, it just kind of does a Ken Burns slow panning effect on it instead of animating it?

Anonymous
10/10/24(Thu)15:39:16 No.102765698

Anonymous 10/10/24(Thu)15:39:16 No.102765698

>>102765680
Minimax actually animate shit but it's not a local model so...

Anonymous
10/10/24(Thu)15:42:01 No.102765733

Anonymous 10/10/24(Thu)15:42:01 No.102765733

>>102765673
aren't they discontinuing the 4090 already?

Anonymous
10/10/24(Thu)15:42:33 No.102765746

Anonymous 10/10/24(Thu)15:42:33 No.102765746

>>102765733
why would they keep manufacturing 4090s?

Anonymous
10/10/24(Thu)15:45:19 No.102765781

Anonymous 10/10/24(Thu)15:45:19 No.102765781

>>102765746
easy money

Anonymous
10/10/24(Thu)15:47:03 No.102765805

Anonymous 10/10/24(Thu)15:47:03 No.102765805

>>102765781
you clearly have never run a business
what happens when they release the 5090?
the factory has capacity limits you know?
why would they sell new 4090s and 5090s side by side?
can you do a business plan that doesn't involve you, as a greedy poorfag, getting a new 4090 for $1000?

Anonymous
10/10/24(Thu)15:49:06 No.102765833

Anonymous 10/10/24(Thu)15:49:06 No.102765833

>>102765805
what are you on about?

Anonymous
10/10/24(Thu)15:49:45 No.102765842

Anonymous 10/10/24(Thu)15:49:45 No.102765842

File: meh.webm (577 KB, 1280x768)

577 KB WEBM

Pyramid 8 fps img2vid: A middle aged female scientist watches a fantastic machine the spins and whirrs with sparks until a peice of fried chicken falls out from the glowing blue middle of the machine

Seems you get 1 gen then have to wait.

Anonymous
10/10/24(Thu)15:50:43 No.102765852

Anonymous 10/10/24(Thu)15:50:43 No.102765852

>>102765833
I personally want to win a billion dollars

Anonymous
10/10/24(Thu)15:52:51 No.102765877

Anonymous 10/10/24(Thu)15:52:51 No.102765877

File: meh-interp.webm (1.34 MB, 1280x768)

1.34 MB WEBM

>>102765842
not bad with interpolation

Anonymous
10/10/24(Thu)15:53:44 No.102765885

Anonymous 10/10/24(Thu)15:53:44 No.102765885

>>102764387
>pic
>authors: ching chong ping pong suk mai ding dong
dropped

Anonymous
10/10/24(Thu)15:54:28 No.102765895

Anonymous 10/10/24(Thu)15:54:28 No.102765895

>>102765885
like it or not anon, but the chinks are the kings of video models, Kling, Minimax, CogVideo, Pyramid...

Anonymous
10/10/24(Thu)15:54:52 No.102765902

Anonymous 10/10/24(Thu)15:54:52 No.102765902

>>102765781
that's what the 5080/5070 is for

Anonymous
10/10/24(Thu)15:55:56 No.102765910

Anonymous 10/10/24(Thu)15:55:56 No.102765910

>>102765885
might as well drop this entire hobby then lol

Anonymous
10/10/24(Thu)15:59:03 No.102765949

Anonymous 10/10/24(Thu)15:59:03 No.102765949

do you think any of them are cute chinese girls?

Anonymous
10/10/24(Thu)16:02:03 No.102765977

Anonymous 10/10/24(Thu)16:02:03 No.102765977

>>102765949
i like to imagine the sweat and juices of many underpaid chinese jade beauties that touched my nvidia gpu during production

Anonymous
10/10/24(Thu)16:03:04 No.102765994

Anonymous 10/10/24(Thu)16:03:04 No.102765994

which version of pytorch should i use with comfy? i remember seeing a comparison image that showed some versions are better than others but forgot which
or is it all just placebo?

Anonymous
10/10/24(Thu)16:03:34 No.102766001

Anonymous 10/10/24(Thu)16:03:34 No.102766001

>>102764930
Been testing this. Seems pretty decent.

Anonymous
10/10/24(Thu)16:04:35 No.102766022

Anonymous 10/10/24(Thu)16:04:35 No.102766022

>>102765895
It's because boob jiggle triggers all the safety teams

Anonymous
10/10/24(Thu)16:05:31 No.102766040

Anonymous 10/10/24(Thu)16:05:31 No.102766040

>>102765877
I'd duplicate the space and try native 24fps but im not paying for it.

Anonymous
10/10/24(Thu)16:05:37 No.102766044

Anonymous 10/10/24(Thu)16:05:37 No.102766044

File: IMG_7785.png (3.18 MB, 1248x1824)

3.18 MB PNG

>>102766001
do you happen to have any examples that arent super sloppafied like picrel

Anonymous
10/10/24(Thu)16:06:31 No.102766057

Anonymous 10/10/24(Thu)16:06:31 No.102766057

>>102766044
we can finally move on from flux

Anonymous
10/10/24(Thu)16:10:07 No.102766115

Anonymous 10/10/24(Thu)16:10:07 No.102766115

>>102765323
yeah and they'd rather train their own model from scratch than use flux

Anonymous
10/10/24(Thu)16:13:15 No.102766154

Anonymous 10/10/24(Thu)16:13:15 No.102766154

>>102766115
that's because flux is too big, their 3b model already asks for fucking 40gb of vram

Anonymous
10/10/24(Thu)16:14:16 No.102766170

Anonymous 10/10/24(Thu)16:14:16 No.102766170

>>102766001
>Been testing this. Seems pretty decent.
care to show some examples

Anonymous
10/10/24(Thu)16:15:37 No.102766194

Anonymous 10/10/24(Thu)16:15:37 No.102766194

>Most of the sample pictures on all loras are done with controlnet/img2img so expect different results if you trying to remix with the civitai generator.

You stupid buzz farming asswipes. Documentation is the most important part of all of this.

t. personal blog man

Anonymous
10/10/24(Thu)16:15:38 No.102766196

Anonymous 10/10/24(Thu)16:15:38 No.102766196

>>102766057
"finally"
hasn't it only been out like 2 months

Anonymous
10/10/24(Thu)16:17:19 No.102766217

Anonymous 10/10/24(Thu)16:17:19 No.102766217

>>102766057
>we can finally move on from flux
so you heard one comment from a single anon (that has no images on top of that) and that's it? it's enough for you to make this insane conclusion?
>t. the least disingenuous Flux hater

Anonymous
10/10/24(Thu)16:18:07 No.102766224

Anonymous 10/10/24(Thu)16:18:07 No.102766224

>>102766057
i kekd

Anonymous
10/10/24(Thu)16:18:58 No.102766237

Anonymous 10/10/24(Thu)16:18:58 No.102766237

File: ComfyUI_00174_ - Copy.png (1.1 MB, 1680x960)

1.1 MB PNG

Anonymous
10/10/24(Thu)16:19:37 No.102766249

Anonymous 10/10/24(Thu)16:19:37 No.102766249

>>102766217
hello saar, how much did the black forest labs pay you?

Anonymous
10/10/24(Thu)16:20:19 No.102766263

Anonymous 10/10/24(Thu)16:20:19 No.102766263

>>102766001
is this command line only at this point?

>>102766196
the hype cycle has been at least 8. I want to say it started when comfyanon got shit canned (yes, that is bait).

Anonymous
10/10/24(Thu)16:20:25 No.102766266

Anonymous 10/10/24(Thu)16:20:25 No.102766266

>>102766249
I ask you this question saar, how much did SAI pay you to smear Flux like that?

Anonymous
10/10/24(Thu)16:21:16 No.102766283

Anonymous 10/10/24(Thu)16:21:16 No.102766283

Bigma

Anonymous
10/10/24(Thu)16:21:29 No.102766287

Anonymous 10/10/24(Thu)16:21:29 No.102766287

>>102766266
no need to pay me anything to smear flux saar, if you want smear just generate realistic gen with base flux, skin already look smear saar

Anonymous
10/10/24(Thu)16:21:41 No.102766291

Anonymous 10/10/24(Thu)16:21:41 No.102766291

File: 9900.png (1.68 MB, 1680x960)

1.68 MB PNG

Anonymous
10/10/24(Thu)16:22:53 No.102766310

Anonymous 10/10/24(Thu)16:22:53 No.102766310

>>102766287
Explain why Flux is so hyped even though for you it's the worst model ever, Lykon.

Anonymous
10/10/24(Thu)16:23:33 No.102766321

Anonymous 10/10/24(Thu)16:23:33 No.102766321

File: 9902.png (1.04 MB, 1680x960)

1.04 MB PNG

Anonymous
10/10/24(Thu)16:24:40 No.102766332

Anonymous 10/10/24(Thu)16:24:40 No.102766332

File: grid-0007.jpg (1.04 MB, 2560x2560)

1.04 MB JPG

>>102766170
Here's something

>>102766263
>is this command line only at this point?
I'm using the safetensor conversion

Anonymous
10/10/24(Thu)16:25:32 No.102766342

Anonymous 10/10/24(Thu)16:25:32 No.102766342

>>102766332
>I'm using the safetensor conversion
on comfyUi? Forge?

Anonymous
10/10/24(Thu)16:25:40 No.102766345

Anonymous 10/10/24(Thu)16:25:40 No.102766345

>>102766310
>hyped
that's all it is saar, hyped. people used it during a great image gen drought, was impressed by prompt understanding and text capabilities, then they saw through it's cracked and got bored. it's been months and nothing has happened. flux isn't even open source.

Anonymous
10/10/24(Thu)16:26:58 No.102766358

Anonymous 10/10/24(Thu)16:26:58 No.102766358

>>102766345
>flux isn't even open source.
Schnell is Apache 2.0, SD3 has a shit licence, nice bait saar

Anonymous
10/10/24(Thu)16:28:19 No.102766375

Anonymous 10/10/24(Thu)16:28:19 No.102766375

File: 00015-1922665712.png (3.36 MB, 1536x1536)

3.36 MB PNG

>>102766342
>on comfyUi? Forge?
reforge

Anonymous
10/10/24(Thu)16:28:23 No.102766377

Anonymous 10/10/24(Thu)16:28:23 No.102766377

File: ComfyUI_temp_uyzyp_00094_.png (1.69 MB, 1072x1880)

1.69 MB PNG

Anonymous
10/10/24(Thu)16:28:42 No.102766382

Anonymous 10/10/24(Thu)16:28:42 No.102766382

>>102766358
>Schnell
Sch-BRAAAAAAAAAAAAAAAAAAP 8 step unfinetunable distilled BRAAAAAAAAAAAAAAAAAAP

Anonymous
10/10/24(Thu)16:29:45 No.102766399

Anonymous 10/10/24(Thu)16:29:45 No.102766399

File: file.png (538 KB, 1198x1148)

538 KB PNG

>>102766382
>unfinetunable distilled
Uh oh...
https://huggingface.co/ostris/OpenFLUX.1

Anonymous
10/10/24(Thu)16:31:15 No.102766421

Anonymous 10/10/24(Thu)16:31:15 No.102766421

>>102766399
spoken like a true saar!
>they left us their dookie doo doo to eat
i'll be waiting for progress!

Anonymous
10/10/24(Thu)16:31:26 No.102766424

Anonymous 10/10/24(Thu)16:31:26 No.102766424

File: file.png (182 KB, 500x500)

182 KB PNG

>>102766382
https://huggingface.co/stabilityai/stable-diffusion-3-medium
>Downloads last month 42,476
https://huggingface.co/black-forest-labs/FLUX.1-dev
>Downloads last month 1,130,973
lmao

Anonymous
10/10/24(Thu)16:31:28 No.102766425

Anonymous 10/10/24(Thu)16:31:28 No.102766425

File: grid-0010.jpg (582 KB, 1728x2304)

582 KB JPG

Anonymous
10/10/24(Thu)16:32:35 No.102766444

Anonymous 10/10/24(Thu)16:32:35 No.102766444

>>102766425
looks overcooked as fuck, maybe your CFG is too high

Anonymous
10/10/24(Thu)16:34:07 No.102766467

Anonymous 10/10/24(Thu)16:34:07 No.102766467

>>102765949
no. girls should stay far far away from this area. they'll simply fuck everything up by lobomotizing the models to make them "safe for women". we need the undivided attention of touch-starved chinks to fuel progress and women will, at best, be a major distraction.

Anonymous
10/10/24(Thu)16:35:30 No.102766483

Anonymous 10/10/24(Thu)16:35:30 No.102766483

>>102766467
>no. girls should stay far far away from this area. they'll simply fuck everything up by lobomotizing the models to make them "safe for wome
this, we've seen the disaster when women went onto the video game industry, they made every female MC ugly because they're jealous of beautiful women

Anonymous
10/10/24(Thu)16:37:06 No.102766512

Anonymous 10/10/24(Thu)16:37:06 No.102766512

>>102766467
this. and if you're desperate just i2i a picture of your face

Anonymous
10/10/24(Thu)16:42:26 No.102766587

Anonymous 10/10/24(Thu)16:42:26 No.102766587

>>102765949
>do you think any of them are cute chinese girls?
I don't really care who's behind this, the only thing that matter to me is the result, I just want a good product at the end.

Anonymous
10/10/24(Thu)16:43:35 No.102766607

Anonymous 10/10/24(Thu)16:43:35 No.102766607

>>102766587
but it would be cooler if some of the ones behind it were cute girls who are cute to look at

Anonymous
10/10/24(Thu)16:44:07 No.102766622

Anonymous 10/10/24(Thu)16:44:07 No.102766622

File: grid-0014.jpg (401 KB, 1728x2304)

401 KB JPG

Anonymous
10/10/24(Thu)16:48:58 No.102766703

Anonymous 10/10/24(Thu)16:48:58 No.102766703

File: 00056-1922665714.png (1.82 MB, 1080x1440)

1.82 MB PNG

Anonymous
10/10/24(Thu)16:59:48 No.102766859

Anonymous 10/10/24(Thu)16:59:48 No.102766859

File: FLUX-201142851440543_00001_.png (340 KB, 512x832)

340 KB PNG

Anonymous
10/10/24(Thu)17:17:15 No.102767125

Anonymous 10/10/24(Thu)17:17:15 No.102767125

File: FLUX-1070729128773991_00001_.png (316 KB, 512x832)

316 KB PNG

People love to scrutinize the small details in AI images. So don't give them any. You need to be blurmaxxing

Anonymous
10/10/24(Thu)17:19:04 No.102767149

Anonymous 10/10/24(Thu)17:19:04 No.102767149

>>102767125
The perspective is fucked up which is ironic because the blur makes it even more apparent

Anonymous
10/10/24(Thu)17:19:11 No.102767151

Anonymous 10/10/24(Thu)17:19:11 No.102767151

>>102764930
>they improved SDXL
Can this be used on Flux aswell?

Anonymous
10/10/24(Thu)17:20:13 No.102767169

Anonymous 10/10/24(Thu)17:20:13 No.102767169

>>102767125
based and blurpilled

Anonymous
10/10/24(Thu)17:23:09 No.102767217

Anonymous 10/10/24(Thu)17:23:09 No.102767217

File: FLUX-26487241193374_00001_.png (331 KB, 512x832)

331 KB PNG

Anonymous
10/10/24(Thu)17:25:44 No.102767260

Anonymous 10/10/24(Thu)17:25:44 No.102767260

File: ComfyUI_temp_uyzyp_00107_.png (1.45 MB, 1072x1880)

1.45 MB PNG

Anonymous
10/10/24(Thu)17:26:37 No.102767273

Anonymous 10/10/24(Thu)17:26:37 No.102767273

File: FLUX-330992772137415_00001_.png (375 KB, 512x832)

375 KB PNG

Anonymous
10/10/24(Thu)17:29:53 No.102767317

Anonymous 10/10/24(Thu)17:29:53 No.102767317

>>102767125
>>102767217
>>102767273
>generating supersized thumbnails
but why

Anonymous
10/10/24(Thu)17:31:39 No.102767349

Anonymous 10/10/24(Thu)17:31:39 No.102767349

>>102767317
I am assuming /sdg/ is shitposting/spamming the thread

Anonymous
10/10/24(Thu)17:33:17 No.102767366

Anonymous 10/10/24(Thu)17:33:17 No.102767366

>>102767349
why would that be your first assumption?

Anonymous
10/10/24(Thu)17:33:43 No.102767377

Anonymous 10/10/24(Thu)17:33:43 No.102767377

>>102767366
hes retarded

Anonymous
10/10/24(Thu)17:36:55 No.102767420

Anonymous 10/10/24(Thu)17:36:55 No.102767420

>>102767366
there is a history of them trolling the thread and it has been stupid women shit, flux vs sd things and more images than this thread usually supports. If it smells like a duck and it clearly underage /sdg/ wants to fuck it.

Anonymous
10/10/24(Thu)17:37:37 No.102767429

Anonymous 10/10/24(Thu)17:37:37 No.102767429

File: FLUX-746501317579306_00001_.png (400 KB, 512x832)

400 KB PNG

>>102767317
There is no such thing as style. Style IS content. An image is whole, contiguous, a fully-connected network of latent layers.

Anonymous
10/10/24(Thu)17:38:43 No.102767447

Anonymous 10/10/24(Thu)17:38:43 No.102767447

>>102767317
Do you not how how latent space works?

Anonymous
10/10/24(Thu)17:40:40 No.102767481

Anonymous 10/10/24(Thu)17:40:40 No.102767481

>>102765642
stop posting slop rumors you gossipy troon

Anonymous
10/10/24(Thu)17:41:34 No.102767486

Anonymous 10/10/24(Thu)17:41:34 No.102767486

File: file.png (118 KB, 256x256)

118 KB PNG

>/ldg/ gens a few months from now
yo guys check out my gen!

Anonymous
10/10/24(Thu)17:43:29 No.102767515

Anonymous 10/10/24(Thu)17:43:29 No.102767515

File: FLUX-514096363763447_00001_.png (361 KB, 512x832)

361 KB PNG

>>102767486
>/ldg/ gens a few months from now
at the current rate it's optimistic to predict that there will be /ldg/ gens a few months from now

Anonymous
10/10/24(Thu)17:45:21 No.102767545

Anonymous 10/10/24(Thu)17:45:21 No.102767545

>1.5: lacks the prompt coherence of later models
>XL: lacks the level of detail present in later models
>Pixart Sigma: lacks enough training
>Kolors: lacks comprehension of the english language
>HunyuanDiT: lacks non asian girl selfie dataset
>SD3: lacks anatomy
>Flux: lacks reasonable hardware requirements
It will never be as good as it once was.

Anonymous
10/10/24(Thu)17:46:18 No.102767562

Anonymous 10/10/24(Thu)17:46:18 No.102767562

File: file.png (128 KB, 256x256)

128 KB PNG

>>102767515
bigma will save us im sure of it

Anonymous
10/10/24(Thu)17:48:51 No.102767601

Anonymous 10/10/24(Thu)17:48:51 No.102767601

>>102766622
>>102766703
>>102766425
someone please fucking fix the AI lighting problem already. i've seen more realistic shit on deviantart

Anonymous
10/10/24(Thu)17:49:45 No.102767619

Anonymous 10/10/24(Thu)17:49:45 No.102767619

File: ComfyUI_temp_uyzyp_00112_.png (1.97 MB, 1072x1880)

1.97 MB PNG

>>102767515
only the stongest will survive

Anonymous
10/10/24(Thu)17:50:09 No.102767624

Anonymous 10/10/24(Thu)17:50:09 No.102767624

File: file.png (36 KB, 128x128)

36 KB PNG

Anonymous
10/10/24(Thu)17:51:33 No.102767642

Anonymous 10/10/24(Thu)17:51:33 No.102767642

File: file.png (11 KB, 64x64)

11 KB PNG

Anonymous
10/10/24(Thu)17:52:34 No.102767658

Anonymous 10/10/24(Thu)17:52:34 No.102767658

>>102767515
5090 and Titan AI will make bespoke 1B-3B models very common very soon.
I'm hoping the new Pixart architecture is friendly to this but if not Pixart Sigma is more than capable. I'll likely make a pretrained 16 channel VAE that is designed for training on 5090s for the purpose of truly having interesting full fine tunes rather than stacking Loras.

Anonymous
10/10/24(Thu)17:54:31 No.102767681

Anonymous 10/10/24(Thu)17:54:31 No.102767681

File: ComfyUI_temp_uyzyp_00114_.png (1.84 MB, 1072x1880)

1.84 MB PNG

Anonymous
10/10/24(Thu)18:00:13 No.102767747

Anonymous 10/10/24(Thu)18:00:13 No.102767747

>>102765147
Did you know that the guys who open sourced Pyramid flow are the same guys who made Kling?
https://www.youtube.com/watch?v=GD6qtc2_AQA

Anonymous
10/10/24(Thu)18:05:16 No.102767832

Anonymous 10/10/24(Thu)18:05:16 No.102767832

File: 0.jpg (127 KB, 832x1216)

127 KB JPG

Anonymous
10/10/24(Thu)18:08:28 No.102767877

Anonymous 10/10/24(Thu)18:08:28 No.102767877

File: file.png (2.54 MB, 1024x1024)

2.54 MB PNG

Anonymous
10/10/24(Thu)18:14:58 No.102767952

Anonymous 10/10/24(Thu)18:14:58 No.102767952

>>102767747
Pyramid - Zhicheng Sun - Peking University - Haidian, Beijing, China
Kuaishou AI - Haidian District, Beijing

You got anything to backup this bullshit claim?

Anonymous
10/10/24(Thu)18:16:37 No.102767974

Anonymous 10/10/24(Thu)18:16:37 No.102767974

>>102767952
I should have said that they only thing that connects these things are they exist in the same location.

Anonymous
10/10/24(Thu)18:18:51 No.102768002

Anonymous 10/10/24(Thu)18:18:51 No.102768002

File: 0.jpg (261 KB, 832x1216)

261 KB JPG

Anonymous
10/10/24(Thu)18:20:21 No.102768016

Anonymous 10/10/24(Thu)18:20:21 No.102768016

File: file.png (223 KB, 2591x704)

223 KB PNG

>>102767952
there's some guy from the Kuaishou Technology, it's the company that made Kling innit?

Anonymous
10/10/24(Thu)18:20:45 No.102768019

Anonymous 10/10/24(Thu)18:20:45 No.102768019

>>102767877
she's cute

Anonymous
10/10/24(Thu)18:22:15 No.102768036

Anonymous 10/10/24(Thu)18:22:15 No.102768036

File: 0.jpg (294 KB, 832x1216)

294 KB JPG

Anonymous
10/10/24(Thu)18:24:03 No.102768046

Anonymous 10/10/24(Thu)18:24:03 No.102768046

>>102768016
funding a uni project is a far different being the guy who made Kling. He will probably be working for Kling shortly, but I can't find anything that says that the he does now or has the past.

Anonymous
10/10/24(Thu)18:25:06 No.102768057

Anonymous 10/10/24(Thu)18:25:06 No.102768057

>>102768046
>funding a uni project is a far different being the guy who made Kling.
they're not just funding it, there's literally guys who are in the company that made Kling who participed in this paper, what else do you want?

Anonymous
10/10/24(Thu)18:27:15 No.102768075

Anonymous 10/10/24(Thu)18:27:15 No.102768075

>>102768057
linkedin or Chinese equivalent.

Zhicheng Sun seems legit. I could be hoping that he doesn't have such ties to corporate ideals.

Anonymous
10/10/24(Thu)18:35:34 No.102768172

Anonymous 10/10/24(Thu)18:35:34 No.102768172

>>102768016
In china, is the government who direct all, there are nor companies.

Anonymous
10/10/24(Thu)18:37:39 No.102768197

Anonymous 10/10/24(Thu)18:37:39 No.102768197

File: file.png (748 KB, 1510x900)

748 KB PNG

>>102768172
So you're telling me that it's Xi Jinping who decided to give us all good local models for free? Damn he's based! I love china now!

Anonymous
10/10/24(Thu)18:40:08 No.102768219

Anonymous 10/10/24(Thu)18:40:08 No.102768219

>>102768197
Yes also, with the western restriction, they cannot buy their models so openly like JewAI, so their response would make their model open and free, so they reduce the gains of Jews.

Anonymous
10/10/24(Thu)18:43:50 No.102768255

Anonymous 10/10/24(Thu)18:43:50 No.102768255

>>102768219
Who would've guessed that the chinks would be the ones who would save us all during this AI clown circus show? Not me, I'm pleasently surprised, any help is a good help

Anonymous
10/10/24(Thu)19:00:00 No.102768437

Anonymous 10/10/24(Thu)19:00:00 No.102768437

File: file.png (923 KB, 3076x1466)

923 KB PNG

https://sihyun.me/REPA/
this shit is interesting, it makes the model learn concepts way faster than the usual

Anonymous
10/10/24(Thu)19:04:29 No.102768478

Anonymous 10/10/24(Thu)19:04:29 No.102768478

>>102768437
you have a link that I can trust?

Anonymous
10/10/24(Thu)19:05:04 No.102768487

Anonymous 10/10/24(Thu)19:05:04 No.102768487

>>102768478
Sure
https://github.com/sihyun-yu/REPA

Anonymous
10/10/24(Thu)19:05:53 No.102768497

Anonymous 10/10/24(Thu)19:05:53 No.102768497

>>102768437
Going to pull apart this buddy, I'm dying to do a new diffusion model. 17x is insane

Anonymous
10/10/24(Thu)19:06:36 No.102768503

Anonymous 10/10/24(Thu)19:06:36 No.102768503

>>102768487
thanks. Looks promising enough to ignore the python3.9 version.

Anonymous
10/10/24(Thu)19:08:47 No.102768528

Anonymous 10/10/24(Thu)19:08:47 No.102768528

>>102768497
>17x is insane
not just that, the final loss function is even lower at the end, so your model will be even better with that technique

Anonymous
10/10/24(Thu)19:12:16 No.102768571

Anonymous 10/10/24(Thu)19:12:16 No.102768571

File: file.jpg (481 KB, 3076x1516)

481 KB JPG

>>102768437
I love those papers, the more we improve the training process, the more accessible it'll be for everyone, at some point we won't have to rely on multi million dollar companies to make good shit

Anonymous
10/10/24(Thu)19:30:40 No.102768769

Anonymous 10/10/24(Thu)19:30:40 No.102768769

File: 2698_.jpg (913 KB, 2688x3456)

913 KB JPG

Is MeshGraphormer still the goto for hands?

>>102768721
mod approved edit. Stupid accidental cameltoe

Anonymous
10/10/24(Thu)19:31:00 No.102768772

Anonymous 10/10/24(Thu)19:31:00 No.102768772

File: ComfyUI_temp_cpnsr_00002_.png (3.12 MB, 1126x1452)

3.12 MB PNG

Anonymous
10/10/24(Thu)20:33:46 No.102769395

Anonymous 10/10/24(Thu)20:33:46 No.102769395

>>102764387

Anonymous
10/10/24(Thu)21:19:05 No.102769761

Anonymous 10/10/24(Thu)21:19:05 No.102769761

>>102767832
Very cool

Anonymous
10/10/24(Thu)21:19:06 No.102769762

Anonymous 10/10/24(Thu)21:19:06 No.102769762

File: 0.jpg (286 KB, 832x1216)

286 KB JPG

Anonymous
10/10/24(Thu)21:51:19 No.102770061

Anonymous 10/10/24(Thu)21:51:19 No.102770061

>>102768571
you will because you still need the huge datasets that we don't have.

Anonymous
10/10/24(Thu)22:16:18 No.102770320

Anonymous 10/10/24(Thu)22:16:18 No.102770320

File: 0.jpg (118 KB, 832x1216)

118 KB JPG

Anonymous
10/10/24(Thu)22:58:00 No.102770722

Anonymous 10/10/24(Thu)22:58:00 No.102770722

>>102767545
>flux lacks reasonable hardware req
512x512 flux-dev-nf4 works fine with midrange cards

Anonymous
10/10/24(Thu)23:02:46 No.102770786

Anonymous 10/10/24(Thu)23:02:46 No.102770786

>>102770061
>you still need the huge datasets that we don't have.
it's not hard to get a dataset, you use Laion, you scrap some of them on the internet...

Anonymous
10/10/24(Thu)23:05:49 No.102770815

Anonymous 10/10/24(Thu)23:05:49 No.102770815

>>102770786
was't laion taken down? due to CSAM or something?

I'll always be haunted by the time I CLIP searched Laion for "pretty college girl cleavage" and a literal picture of my old next-door neighbor was in the results

Anonymous
10/10/24(Thu)23:07:00 No.102770834

Anonymous 10/10/24(Thu)23:07:00 No.102770834

>>102770815
>was't laion taken down?
no they brought that back recently after cleaning it

Anonymous
10/10/24(Thu)23:07:01 No.102770835

Anonymous 10/10/24(Thu)23:07:01 No.102770835

>>102770722
"works fine" more like "cool to see for the first time, then you realize it's not worth it"

Anonymous
10/10/24(Thu)23:07:48 No.102770843

Anonymous 10/10/24(Thu)23:07:48 No.102770843

>>102770786
laion being garbage is the reason sd1.5 and XL are so rudimentary.

Anonymous
10/10/24(Thu)23:08:00 No.102770844

Anonymous 10/10/24(Thu)23:08:00 No.102770844

>>102770835
now you're making a different complaint. one I disagree with

Anonymous
10/10/24(Thu)23:12:17 No.102770901

Anonymous 10/10/24(Thu)23:12:17 No.102770901

>>102770844
i should have phrased it differently. recommended hardware requirements. it's a big model. quants don't really improve speed just space optimization.

Anonymous
10/10/24(Thu)23:12:33 No.102770904

Anonymous 10/10/24(Thu)23:12:33 No.102770904

>>102770815
I scraped millions of images using duckduckgo, it's not hard. Just get ChatGPT to generate thousands of search queries and download everything high resolution.

Anonymous
10/10/24(Thu)23:13:41 No.102770921

Anonymous 10/10/24(Thu)23:13:41 No.102770921

>>102770901
>it's a big model. quants don't really improve speed just space optimization.
it's true, I wished it would be faster to render a single image on Flux, especially when I'm CFGmaxxing

Anonymous
10/10/24(Thu)23:26:46 No.102771148

Anonymous 10/10/24(Thu)23:26:46 No.102771148

File: ComfyUI_34282_.png (1.4 MB, 848x1024)

1.4 MB PNG

Anonymous
10/10/24(Thu)23:32:33 No.102771230

Anonymous 10/10/24(Thu)23:32:33 No.102771230

File: ComfyUI_34283_.png (989 KB, 848x1024)

989 KB PNG

Anonymous
10/10/24(Thu)23:34:47 No.102771264

Anonymous 10/10/24(Thu)23:34:47 No.102771264

>>102771230
lol the one on the left bed
>"Sir you need to put your blankets over your lower body..."

Anonymous
10/10/24(Thu)23:39:30 No.102771321

Anonymous 10/10/24(Thu)23:39:30 No.102771321

File: ComfyUI_34288_.png (1.26 MB, 848x1024)

1.26 MB PNG

Anonymous
10/10/24(Thu)23:44:58 No.102771378

Anonymous 10/10/24(Thu)23:44:58 No.102771378

File: ComfyUI_34293_.png (1.25 MB, 848x1024)

1.25 MB PNG

Anonymous
10/11/24(Fri)00:03:28 No.102771570

Anonymous 10/11/24(Fri)00:03:28 No.102771570

>>102764413
bigma ballz

Anonymous
10/11/24(Fri)00:11:09 No.102771632

Anonymous 10/11/24(Fri)00:11:09 No.102771632

File: file.png (1.31 MB, 3013x1574)

1.31 MB PNG

>>102765147
https://pyramid-flow.github.io/
I have a serious question here, why are the scores so close to each other? Kling is miles ahead that Pyramid model yet the number suggests they're on the same level, that's complete bullshit

Anonymous
10/11/24(Fri)00:19:00 No.102771699

Anonymous 10/11/24(Fri)00:19:00 No.102771699

>>102771632
it only took you two years to realize benchmarks are meaningless

Anonymous
10/11/24(Fri)00:27:22 No.102771767

Anonymous 10/11/24(Fri)00:27:22 No.102771767

File: FLUX-1107119187050535_00001_.png (326 KB, 512x832)

326 KB PNG

>>102770904
>and download everything high resolution.
And why would I want to limit myself that way?

Anonymous
10/11/24(Fri)00:35:31 No.102771829

Anonymous 10/11/24(Fri)00:35:31 No.102771829

Time for some REPA of ass. Trying 16 channel VAE training, too bad it's based on just a 256px crop model.

Anonymous
10/11/24(Fri)00:41:51 No.102771856

Anonymous 10/11/24(Fri)00:41:51 No.102771856

>>102771829
can REPA be used for finetunes aswell? we would improve Flux a lot with it

Anonymous
10/11/24(Fri)00:43:57 No.102771867

Anonymous 10/11/24(Fri)00:43:57 No.102771867

>>102771856
You're essentially use CLIP as a regulation technique when computing losses, so yes. I'm sure there are other ways to apply it too.

Anonymous
10/11/24(Fri)01:13:05 No.102772076

Anonymous 10/11/24(Fri)01:13:05 No.102772076

>>102771867
>You're essentially use CLIP as a regulation technique when computing losses
imagine if you use T5, goddam the possibilities are endless

Anonymous
10/11/24(Fri)01:15:26 No.102772098

Anonymous 10/11/24(Fri)01:15:26 No.102772098

>>102772076
They're using the image features, so it would be more like using Florence to create losses.

Anonymous
10/11/24(Fri)01:31:01 No.102772216

Anonymous 10/11/24(Fri)01:31:01 No.102772216

File: HomoUI_00001_.jpg (30 KB, 682x512)

30 KB JPG

>>102767515
Preach it sister!

Anonymous
10/11/24(Fri)01:34:21 No.102772246

Anonymous 10/11/24(Fri)01:34:21 No.102772246

File: 57964.png (2.38 MB, 1440x3120)

2.38 MB PNG

is this whole gay ass fucking website dead? maybe the nukes started flying in the mid east and we didn't hear about it yet?

>>102772216
hell yeah brother, comfysisters btfo

Anonymous
10/11/24(Fri)01:40:41 No.102772296

Anonymous 10/11/24(Fri)01:40:41 No.102772296

I uploaded the Q8_0 version of dev2pro (another undistilled dev model)
https://huggingface.co/TheYuriLover/Flux-Dev2Pro-GGUF/tree/main
I still prefer de-distill but it's not that bad

Anonymous
10/11/24(Fri)01:51:48 No.102772394

Anonymous 10/11/24(Fri)01:51:48 No.102772394

File: 57970.png (1.01 MB, 1440x1440)

1.01 MB PNG

heh

Anonymous
10/11/24(Fri)02:33:42 No.102772746

Anonymous 10/11/24(Fri)02:33:42 No.102772746

>>102772246
>is this whole gay ass fucking website dead?

A lot of people have been banned.

Anonymous
10/11/24(Fri)02:48:38 No.102772867

Anonymous 10/11/24(Fri)02:48:38 No.102772867

>>102772746
probably for the best, but i dismay at the apparent attrition for our diffusion threads.

Anonymous
10/11/24(Fri)02:53:43 No.102772912

Anonymous 10/11/24(Fri)02:53:43 No.102772912

File: Screenshot from 2024-10-1(...).png (133 KB, 955x986)

133 KB PNG

Pyramid is saved! (after the comfy integration FUCKED some peoples Comfy setups, mine included lol (downgrade numpy then use checkpoint to repair shit/uninstall problem nodes, delete the integration and re add the broken nodes))
(really bad release so far but they are retraining to shake off the SD3 sauce)

Anonymous
10/11/24(Fri)02:53:54 No.102772915

Anonymous 10/11/24(Fri)02:53:54 No.102772915

File: 57977.png (1.48 MB, 1440x3120)

1.48 MB PNG

i waste this gen on you lot
not because i must,
but because i can,

ps 49 times...

Anonymous
10/11/24(Fri)03:19:37 No.102773170

Anonymous 10/11/24(Fri)03:19:37 No.102773170

File: 00036_.jpg (452 KB, 1792x2304)

452 KB JPG

>>102772912
I lost. Fuck that guy. Fuck anyone who just posts solutions at random.
>python 3.8 is history
End of life was Monday. Fuck him and his fucking waste of resources that he causes.

Anonymous
10/11/24(Fri)03:23:59 No.102773207

Anonymous 10/11/24(Fri)03:23:59 No.102773207

>>102772912
I'm not gonna go that path, they're retraining their model so I'll wait until they got the best one out of the nature

Anonymous
10/11/24(Fri)03:42:08 No.102773368

Anonymous 10/11/24(Fri)03:42:08 No.102773368

>>102773170
The code is there on github, why doesnt he just rewrite it to be compatible with 3.10?
Oh yeah i forgot, he's a money grabbing women-like (complain, don't offer a solution or do any coding work towards it then hold up a sign behind a paywall that says "I made this" while pointing to the work of others that you complained about) grifter.

Anonymous
10/11/24(Fri)04:20:51 No.102773659

Anonymous 10/11/24(Fri)04:20:51 No.102773659

File: FLUX-888661508534989_00001_.png (365 KB, 512x832)

365 KB PNG

Anonymous
10/11/24(Fri)04:27:34 No.102773707

Anonymous 10/11/24(Fri)04:27:34 No.102773707

>>102764387
ahh so thats what sailor moon would look like if she had downs syndrome

Anonymous
10/11/24(Fri)04:41:21 No.102773798

Anonymous 10/11/24(Fri)04:41:21 No.102773798

File: 4010243045.jpg (3.34 MB, 2688x3456)

3.34 MB JPG

Anonymous
10/11/24(Fri)04:44:04 No.102773815

Anonymous 10/11/24(Fri)04:44:04 No.102773815

>>102765642
I could believe the 5090 but the others seem implausible. Anything less than 16GB for a xx70 seems pointless, and $1k+ for a 16GB card also seems like a hard sell. I could imagine them nickel and diming with 20GB for the xx80 though.

Anonymous
10/11/24(Fri)04:45:15 No.102773821

Anonymous 10/11/24(Fri)04:45:15 No.102773821

>>102773815
>$1k+ for a 16GB card also seems like a hard sell.
don't forget those are graphic cards, you don't need much more than 16gb to run the latest games, so people won't mind if it gets better speed than the 4090

Anonymous
10/11/24(Fri)04:50:27 No.102773868

Anonymous 10/11/24(Fri)04:50:27 No.102773868

>>102773821
I don't know about that, you don't really need more than 16GB of RAM for games either, but people still buy 64GB

Anonymous
10/11/24(Fri)04:54:18 No.102773896

Anonymous 10/11/24(Fri)04:54:18 No.102773896

File: 1722412552754502.png (144 KB, 400x712)

144 KB PNG

Anonymous
10/11/24(Fri)04:55:55 No.102773913

Anonymous 10/11/24(Fri)04:55:55 No.102773913

>>102767562
Is this AI? Model/catbox?

Anonymous
10/11/24(Fri)04:57:19 No.102773926

Anonymous 10/11/24(Fri)04:57:19 No.102773926

>>102771148
Love this

Anonymous
10/11/24(Fri)05:22:57 No.102774111

Anonymous 10/11/24(Fri)05:22:57 No.102774111

File: 2412708583.png (3.65 MB, 2304x1792)

3.65 MB PNG

Anonymous
10/11/24(Fri)05:30:05 No.102774167

Anonymous 10/11/24(Fri)05:30:05 No.102774167

File: pumpkin.jpg (1.5 MB, 5040x2480)

1.5 MB JPG

I tried to get flux to do some lazy halloween costumes. I created an image and then got flux to do i2i. Left is when I let flux have high denoise then how much flux is used lowers as it goes left. Is this because I had Halloween words in there and it wanted to turn it animated or simply a skill issue.

I saw the strap. I don't care if I am testing.

Anonymous
10/11/24(Fri)05:36:43 No.102774207

Anonymous 10/11/24(Fri)05:36:43 No.102774207

File: file.webm (841 KB, 1200x720)

841 KB WEBM

https://xcancel.com/cubiq/status/1844332817767072128#m
kek, Pyramid Flow looks fun to play with, too bad it's asking for too much VRAM though

Anonymous
10/11/24(Fri)06:09:02 No.102774404

Anonymous 10/11/24(Fri)06:09:02 No.102774404

File: 00122-1757856519.png (1.26 MB, 1024x1024)

1.26 MB PNG

>>102773913
sorry i didn't bother saving that specific gen but this should have the same prompt and settings i used
>https://files.catbox.moe/t1y61h.png
it's llustriousXL_smoothftSPO with 10 sampling steps downscaled to 256x256 to make it extra blurry and appealing for the average flux user. i stole most of the prompt from an anon in /h/

Anonymous
10/11/24(Fri)06:19:08 No.102774460

Anonymous 10/11/24(Fri)06:19:08 No.102774460

>>102774404
>i stole most of the prompt from an anon in /h/
>>>8251510
this one

Anonymous
10/11/24(Fri)06:21:10 No.102774476

Anonymous 10/11/24(Fri)06:21:10 No.102774476

>>102774460
nice

Anonymous
10/11/24(Fri)06:23:12 No.102774492

Anonymous 10/11/24(Fri)06:23:12 No.102774492

>>102774460
how do i crosspost
>>>/h/8251510

Anonymous
10/11/24(Fri)06:25:13 No.102774502

Anonymous 10/11/24(Fri)06:25:13 No.102774502

>>102774492
>how do i crosspost
yeah it worked anon, everyone is talking about that illustrous model, did it deprecate pony or it's not cooked enough yet?

Anonymous
10/11/24(Fri)06:26:31 No.102774512

Anonymous 10/11/24(Fri)06:26:31 No.102774512

File: file.png (1.65 MB, 1024x1024)

1.65 MB PNG

Anonymous
10/11/24(Fri)06:30:04 No.102774538

Anonymous 10/11/24(Fri)06:30:04 No.102774538

>>102774502
>did it deprecate pony or it's not cooked enough yet?
for me, both. looks way better than ponysloppusion but i don't do hardcore sex gens so not sure about that but it's also undercooked, kind of unstable but the smoothftspo tune helps alot with that. it knows alot of artists but because it's undercooked they are only really useful for mixing styles, i recommend it.

Anonymous
10/11/24(Fri)06:51:45 No.102774690

Anonymous 10/11/24(Fri)06:51:45 No.102774690

File: ComfyUI_34296_.png (980 KB, 848x1024)

980 KB PNG

Anonymous
10/11/24(Fri)06:52:19 No.102774693

Anonymous 10/11/24(Fri)06:52:19 No.102774693

File: ComfyUI_34302_.png (858 KB, 848x1024)

858 KB PNG

Anonymous
10/11/24(Fri)06:52:51 No.102774697

Anonymous 10/11/24(Fri)06:52:51 No.102774697

File: ComfyUI_34307_.png (1.24 MB, 848x1024)

1.24 MB PNG

Anonymous
10/11/24(Fri)06:53:25 No.102774700

Anonymous 10/11/24(Fri)06:53:25 No.102774700

File: ComfyUI_34316_.png (1.05 MB, 848x1024)

1.05 MB PNG

Anonymous
10/11/24(Fri)06:53:47 No.102774705

Anonymous 10/11/24(Fri)06:53:47 No.102774705

File: file.png (1.76 MB, 1024x1024)

1.76 MB PNG

Anonymous
10/11/24(Fri)07:04:38 No.102774788

Anonymous 10/11/24(Fri)07:04:38 No.102774788

File: IMG_0565.png (918 KB, 1024x1024)

918 KB PNG

>>102766382
>dedistill
>finetune
>enjoy

Anonymous
10/11/24(Fri)07:18:55 No.102774891

Anonymous 10/11/24(Fri)07:18:55 No.102774891

File: file.png (1.8 MB, 1024x1024)

1.8 MB PNG

>>102774788
ftfy

Anonymous
10/11/24(Fri)07:37:47 No.102775030

Anonymous 10/11/24(Fri)07:37:47 No.102775030

>>102774891
Did you just inpaint the nipples

Anonymous
10/11/24(Fri)07:45:10 No.102775107

Anonymous 10/11/24(Fri)07:45:10 No.102775107

>>102775030
yeah

Anonymous
10/11/24(Fri)08:08:55 No.102775318

Anonymous 10/11/24(Fri)08:08:55 No.102775318

>>102775107
Nice

Anonymous
10/11/24(Fri)08:22:37 No.102775463

Anonymous 10/11/24(Fri)08:22:37 No.102775463

> | 7/8 [15:33<02:36, 156.20s/it]
> | 7/8 [15:25<02:33, 153.38s/it]
My 16GB on Flux Q8 ;_;

Anonymous
10/11/24(Fri)08:25:18 No.102775492

Anonymous 10/11/24(Fri)08:25:18 No.102775492

>>102774693
Nice

Anonymous
10/11/24(Fri)08:40:40 No.102775640

Anonymous 10/11/24(Fri)08:40:40 No.102775640

>>102775463
is your batch size higher than one?
also use a lower quant, Q6_K should be about as good, the K stands for quality

Anonymous
10/11/24(Fri)08:43:34 No.102775664

Anonymous 10/11/24(Fri)08:43:34 No.102775664

>>102775640
Batch size is a mere 1.

Anonymous
10/11/24(Fri)08:46:18 No.102775691

Anonymous 10/11/24(Fri)08:46:18 No.102775691

>>102775664
are you loading T5 and clip on cpu or gpu?

Anonymous
10/11/24(Fri)08:50:10 No.102775729

Anonymous 10/11/24(Fri)08:50:10 No.102775729

>>102775691
t5xx16fp,
>clip on cpu or gpu
?
Swap location: Shared

Anonymous
10/11/24(Fri)08:54:29 No.102775760

Anonymous 10/11/24(Fri)08:54:29 No.102775760

>>102775729
forge?
try the other swap location

Anonymous
10/11/24(Fri)08:55:58 No.102775765

Anonymous 10/11/24(Fri)08:55:58 No.102775765

>>102775760
>the other swap location
That's slow (4.3something s/it) on Q4 and NF4 already.

Anonymous
10/11/24(Fri)09:02:40 No.102775824

Anonymous 10/11/24(Fri)09:02:40 No.102775824

>>102775765
i think swap location might be for the model layers then, and not the text encoders
im actually not sure where it loads T5 and clip, and if you have an option to change it
maybe check your memory stats while everything is loading so you can identify what goes where, and consider trying it with comfy instead or just swapping to a lower quant
also, if that 16GB happens to be an AMD card, i think it is going to be slower regardless and you should look online for how other people deal with it

Anonymous
10/11/24(Fri)09:07:26 No.102775854

Anonymous 10/11/24(Fri)09:07:26 No.102775854

File: 0.jpg (208 KB, 832x1216)

208 KB JPG

Anonymous
10/11/24(Fri)09:09:39 No.102775869

Anonymous 10/11/24(Fri)09:09:39 No.102775869

>>102775824
The card is a 4060TI. And I don't think it lets me set where to load T5 and Clip, according to console it looks like it puts everything in VRAM.
>Skipping unconditional conditioning when CFG = 1. Negative Prompts are ignored.
>[Unload] Trying to free 13464.34 MB for cuda:0 with 0 models keep loaded ... Done.
>[Memory Management] Target: JointTextEncoder, Free GPU: 14539.60 MB, Model Require: 9569.49 MB, Previously Loaded: 0.00 MB, Inference Require: 1024.00 MB, Remaining: 3946.11 MB, >All loaded to GPU.
>Moving model(s) has taken 24.69 seconds
>Distilled CFG Scale: 3.5
>[Unload] Trying to free 17053.25 MB for cuda:0 with 0 models keep loaded ... Current free memory is 4883.03 MB ... Unload model JointTextEncoder Done.
>[Memory Management] Target: KModel, Free GPU: 14530.14 MB, Model Require: 12125.39 MB, Previously Loaded: 0.00 MB, Inference Require: 1024.00 MB, Remaining: 1380.74 MB, All loaded to GPU.
>Moving model(s) has taken 69.72 seconds
>100%| | 8/8 [18:50<00:00, 141.33s/it]
>[Unload] Trying to free 4495.77 MB for cuda:0 with 0 models keep loaded ... Current free memory is 3353.52 MB ... Unload model KModel Done. | 8/8 [18:42<00:00, 167.05s/it]
>[Memory Management] Target: IntegratedAutoencoderKL, Free GPU: 14528.17 MB, Model Require: 159.87 MB, Previously Loaded: 0.00 MB, Inference Require: 1024.00 MB, Remaining: >13344.30 MB, All loaded to GPU.
>Moving model(s) has taken 171.20 seconds
>Total progress: 100%| | 8/8 [21:35<00:00, 161.89s/it]

Anonymous
10/11/24(Fri)09:12:56 No.102775897

Anonymous 10/11/24(Fri)09:12:56 No.102775897

>>102775869
looks like it does but also unloads them and then fully loads the Q8 into vram so there's no way it should be that slow
are you trying to gen images in 4k or something?

Anonymous
10/11/24(Fri)09:16:46 No.102775916

Anonymous 10/11/24(Fri)09:16:46 No.102775916

>>102764387
Never tough id say this, but that downie is looking kind of hot XD

Anonymous
10/11/24(Fri)09:17:50 No.102775928

Anonymous 10/11/24(Fri)09:17:50 No.102775928

>>102775897
>unloads them and then fully loads the Q8 into vram
Possibly there's a part from previous gen with NF4 model. Loading the Q8 took more the 24s what it shows in the pasta.
>are you trying to gen images in 4k or something?
Just 1mp with the default preset of Forge for Flux at 896x1152

Anonymous
10/11/24(Fri)09:32:44 No.102776031

Anonymous 10/11/24(Fri)09:32:44 No.102776031

>>102775928
if you switched from NF4 to this Q8 without restarting once ever since, unironically try turning it off and on again
forge is not free of bugs unfortunately

Anonymous
10/11/24(Fri)09:40:37 No.102776106

Anonymous 10/11/24(Fri)09:40:37 No.102776106

>>102776031
I don't know, I rather would not restart Forge because one of the bugs is that it removes the generated image from the UI when it finished a new gen after a restart. Need to reload UI too which resets all the current parameters and prompt and resets to default preset.
However switching from Flux to SDXL and back or swapping different Flux models doesn't impact the speed of Flux nor XL.

Anonymous
10/11/24(Fri)09:47:20 No.102776163

Anonymous 10/11/24(Fri)09:47:20 No.102776163

>>102776106
it might remove it from the UI but should still be in the outputs folder
there is a PNG info tab that you can drag your image into and then click "Send to txt2img"
you can also store your settings as a preset with a plugin, look it up

Anonymous
10/11/24(Fri)09:48:52 No.102776179

Anonymous 10/11/24(Fri)09:48:52 No.102776179

Noob question. But when I use a lora. Does the lora eat the steps in my settings or does it use its own steps?

Anonymous
10/11/24(Fri)09:53:14 No.102776222

Anonymous 10/11/24(Fri)09:53:14 No.102776222

>>102776179
the weights get merged onto your model during inference, before the steps, so yeah same settings

Anonymous
10/11/24(Fri)09:54:43 No.102776237

Anonymous 10/11/24(Fri)09:54:43 No.102776237

>>102776163
Yeah I know but it's still annoying, that I have to do that, open file browser, navigate to folder, drag it into the info tag... I rather set the three numbers I changed again and copy the prompt before reload. But then reloading itself again takes a time.

Anonymous
10/11/24(Fri)09:59:38 No.102776286

Anonymous 10/11/24(Fri)09:59:38 No.102776286

>>102776237
just try it once to see if it fixes the issue

Anonymous
10/11/24(Fri)10:10:34 No.102776400

Anonymous 10/11/24(Fri)10:10:34 No.102776400

When will Nvidia increase the number of threads?

Anonymous
10/11/24(Fri)10:15:26 No.102776429

Anonymous 10/11/24(Fri)10:15:26 No.102776429

I fucking hate python's dependecy system and conda
I finally got Pyramid Flow running on my computer

Anonymous
10/11/24(Fri)10:16:44 No.102776446

Anonymous 10/11/24(Fri)10:16:44 No.102776446

>>102776429
>python's dependecy system and conda
And they're unaware that they suck and package management.

Anonymous
10/11/24(Fri)10:27:17 No.102776546

Anonymous 10/11/24(Fri)10:27:17 No.102776546

File: pyram.png (3 KB, 1082x22)

3 KB PNG

>>102776429
Not as slow as I thought it would be

Anonymous
10/11/24(Fri)10:28:43 No.102776557

Anonymous 10/11/24(Fri)10:28:43 No.102776557

>>102776429
>filtered by python -m venv venv

Anonymous
10/11/24(Fri)10:30:31 No.102776576

Anonymous 10/11/24(Fri)10:30:31 No.102776576

File: 1708843867582408.jpg (26 KB, 446x446)

26 KB JPG

>>102764387
https://civitai.com/models/836888/flux1-schnell-fp8

This one is roughly 16 GB

https://civitai.com/models/622579/flux1-dev-fp8

This one is around 11 GB

https://huggingface.co/city96/FLUX.1-schnell-gguf/tree/main

And these ring from two gigs to 20 gigs plus.

What would be the best one to use if file size reduction and generation speed are priority for you? Also how are people even pruning these models? Does anyone know how to do that?

Anonymous
10/11/24(Fri)10:30:36 No.102776578

Anonymous 10/11/24(Fri)10:30:36 No.102776578

>>102776557
The packages that came on requirements.txt weren't compatible with each other and I had to modify the code to make it work because these chinks don't know how numpy arrays work
And conda was a pain in the ass to set up

Anonymous
10/11/24(Fri)10:32:18 No.102776590

Anonymous 10/11/24(Fri)10:32:18 No.102776590

>>102776546
Nvm s/it blew up in the next few steps and now it's at 52s/it at 12th iteration

Anonymous
10/11/24(Fri)10:33:26 No.102776602

Anonymous 10/11/24(Fri)10:33:26 No.102776602

>>102776578
>bloo bloo bloo it's not compatible with my bastard comfyui setup with dozens of custom modules with their own requirements

Anonymous
10/11/24(Fri)10:37:21 No.102776641

Anonymous 10/11/24(Fri)10:37:21 No.102776641

>>102776602
I just want things to werk, I got things to do besides modifying retarded code and figuring out which combination of versions of 30 modules makes the retarded code work.

Anonymous
10/11/24(Fri)10:39:48 No.102776670

Anonymous 10/11/24(Fri)10:39:48 No.102776670

File: output.webm (519 KB, 1280x768)

519 KB WEBM

Wanted to test how well the model knows real life physics, it's better than I expected but I asked for the avocado to be falling inside a bucket full of water, not the water to fall in a bucket full of avocado

Anonymous
10/11/24(Fri)10:39:53 No.102776672

Anonymous 10/11/24(Fri)10:39:53 No.102776672

>>102776641
Your expectations don't align with the cutting edge software you're working with. Whether you like it or not you're not working with consumer tools or software. Feel free to come back in 10 years when it's all packaged into an app for your phone.

Anonymous
10/11/24(Fri)10:51:18 No.102776791

Anonymous 10/11/24(Fri)10:51:18 No.102776791

File: ComfyUI_temp_movuq_00005_.png (3.41 MB, 1177x1518)

3.41 MB PNG

Anonymous
10/11/24(Fri)10:54:07 No.102776818

Anonymous 10/11/24(Fri)10:54:07 No.102776818

1girl supremacy

Anonymous
10/11/24(Fri)10:59:33 No.102776882

Anonymous 10/11/24(Fri)10:59:33 No.102776882

>>102776429
What are you running to make that possible?

>>102776576
The Q1 version. Be aware you asked a speed, size, quality question.
Quantification
Yes. There are many how to quant guides out there.

>>102776818
262/86 ratio with low/no 1girl yous. Plus the asswipe flooding the thread with blurred pics. I weep for the lack of 1girl supremacy.

Anonymous
10/11/24(Fri)11:02:12 No.102776912

Anonymous 10/11/24(Fri)11:02:12 No.102776912

>>102776882
>asswipe flooding the thread with blurred pics
you wouldn't get it

Anonymous
10/11/24(Fri)11:02:55 No.102776921

Anonymous 10/11/24(Fri)11:02:55 No.102776921

>>102776882
>What are you running to make that possible?
3090, it's using 23.5GB

Anonymous
10/11/24(Fri)11:05:25 No.102776947

Anonymous 10/11/24(Fri)11:05:25 No.102776947

File: ComfyUI_temp_movuq_00010_.png (2.58 MB, 1177x1518)

2.58 MB PNG

Anonymous
10/11/24(Fri)11:07:40 No.102776983

Anonymous 10/11/24(Fri)11:07:40 No.102776983

File: ComfyUI_temp_movuq_00011_.png (2.56 MB, 1177x1518)

2.56 MB PNG

Anonymous
10/11/24(Fri)11:10:39 No.102777012

Anonymous 10/11/24(Fri)11:10:39 No.102777012

File: ComfyUI_temp_movuq_00012_.png (2.64 MB, 1177x1518)

2.64 MB PNG

Anonymous
10/11/24(Fri)11:21:54 No.102777157

Anonymous 10/11/24(Fri)11:21:54 No.102777157

Gunna REPA the Sigma in the butt

Anonymous
10/11/24(Fri)11:24:13 No.102777189

Anonymous 10/11/24(Fri)11:24:13 No.102777189

>>102776578
It's very telling that the chink devs CANNOT construct a requirements.txt that works in a new environment.
Personally i do not trust this project, they seem to have the skill level of undergrads who have copied someone elses work and really have no idea how to present it to the outside world.

Anonymous
10/11/24(Fri)11:24:26 No.102777195

Anonymous 10/11/24(Fri)11:24:26 No.102777195

txt2vid in pyramid is surprisingly good, kudos to the creators
but the img2vid is very bad

Anonymous
10/11/24(Fri)11:24:54 No.102777203

Anonymous 10/11/24(Fri)11:24:54 No.102777203

File: ComfyUI_temp_movuq_00016_.png (2.51 MB, 1177x1518)

2.51 MB PNG

Anonymous
10/11/24(Fri)11:24:56 No.102777204

Anonymous 10/11/24(Fri)11:24:56 No.102777204

>>102776576
>generation speed
They don't speed up inference like that unfortunately. Flux will always be a monster.

Anonymous
10/11/24(Fri)11:25:36 No.102777218

Anonymous 10/11/24(Fri)11:25:36 No.102777218

>>102777189
the models itself is pretty good and I don't think a bunch of undergrads would have access to 20k hours of A100, maybe they were using some other version of numpy or torch or whatever but they should indicate that imo

Anonymous
10/11/24(Fri)11:26:21 No.102777228

Anonymous 10/11/24(Fri)11:26:21 No.102777228

>>102777218
you're talking to a seething no coder whose experience with software is downloading apps on Android

Anonymous
10/11/24(Fri)11:28:09 No.102777254

Anonymous 10/11/24(Fri)11:28:09 No.102777254

>>102777228
that's me you fucking retard
you can't even follow the order in a conversation, how would you feel if you hadn't eaten breakfast?

Anonymous
10/11/24(Fri)11:31:02 No.102777296

Anonymous 10/11/24(Fri)11:31:02 No.102777296

>>102777254
I don't care, you're both retarded.
>someone made a model I really really want to use
>but they must be incompetent though
Can you at least be a tad more intelligent? Or are you really just an entitled faggot that is mad people who give things for free to him isn't doing it to the standards of his silver spoon life?

Anonymous
10/11/24(Fri)11:32:13 No.102777314

Anonymous 10/11/24(Fri)11:32:13 No.102777314

we were never meant to have local video gen its too powerful an idea

Anonymous
10/11/24(Fri)11:32:40 No.102777320

Anonymous 10/11/24(Fri)11:32:40 No.102777320

>>102777296
Im telling you I had to fix their own code because they were trying to convert a python array to a tensor using a numpy method
You sound underage, go back to wherever you came from

Anonymous
10/11/24(Fri)11:33:27 No.102777333

Anonymous 10/11/24(Fri)11:33:27 No.102777333

>>102777296
>Basic intelligence is a gift you pig!
Maybe in your world, not in AI land, your world being Chinese land btw.

Anonymous
10/11/24(Fri)11:33:41 No.102777336

Anonymous 10/11/24(Fri)11:33:41 No.102777336

>>102777320
clearly the code work on their system
I don't care
I'm more inclined to believe you are a retard

Anonymous
10/11/24(Fri)11:34:51 No.102777352

Anonymous 10/11/24(Fri)11:34:51 No.102777352

>>102777333
Feel free not to use the model since China is le ebil, but it makes me laugh how you have to use it

Anonymous
10/11/24(Fri)11:36:13 No.102777367

Anonymous 10/11/24(Fri)11:36:13 No.102777367

>>102777336
>works on my machine
so you are the retarded nocoder? fucking hell leave 4chan you sound new and tryhardy

Anonymous
10/11/24(Fri)11:39:14 No.102777400

Anonymous 10/11/24(Fri)11:39:14 No.102777400

>>102777367
I know you must be retarded but "it works on my machine" basically says it's an ID-10-T error. Troubleshoot the problem between the chair and the computer. After being here long enough I've realized you people can't follow basic instructions.

Anonymous
10/11/24(Fri)11:49:51 No.102777515

Anonymous 10/11/24(Fri)11:49:51 No.102777515

>>102776882
>the asswipe flooding the thread with blurred pics
blurred 1girl pics*

Anonymous
10/11/24(Fri)11:56:00 No.102777600

Anonymous 10/11/24(Fri)11:56:00 No.102777600

File: file.png (41 KB, 409x371)

41 KB PNG

Holy shit, REPA just werks

Anonymous
10/11/24(Fri)12:02:50 No.102777687

Anonymous 10/11/24(Fri)12:02:50 No.102777687

>>102764387
What Local Model is 100% privacy friendly, not allowing anything to go out from your computer?

Anonymous
10/11/24(Fri)12:03:48 No.102777703

Anonymous 10/11/24(Fri)12:03:48 No.102777703

>>102777687
>which CSV is 100% privacy friendly

Anonymous
10/11/24(Fri)12:05:54 No.102777739

Anonymous 10/11/24(Fri)12:05:54 No.102777739

>>102777600
HYPE

Anonymous
10/11/24(Fri)12:08:39 No.102777772

Anonymous 10/11/24(Fri)12:08:39 No.102777772

>>102777739
I wonder what happens if you stack of perceptual loss, since you're already doing CLIP which requires images you could put perceptual loss on it as well and you're probably going to get some great results and alignment.

Anonymous
10/11/24(Fri)12:08:53 No.102777776

Anonymous 10/11/24(Fri)12:08:53 No.102777776

I miss titty elfs

Anonymous
10/11/24(Fri)12:28:54 No.102778076

Anonymous 10/11/24(Fri)12:28:54 No.102778076

>>102775916
>Never tough id say this, but that downie is looking kind of hot XD
Like wise man once said: "those titties ain't retarded"

Anonymous
10/11/24(Fri)12:29:28 No.102778084

Anonymous 10/11/24(Fri)12:29:28 No.102778084

>>102776882
>low/no 1girl yous
desu skill issue

Anonymous
10/11/24(Fri)12:35:42 No.102778160

Anonymous 10/11/24(Fri)12:35:42 No.102778160

>>102772912
>>102773170
>>102773368
I don't understand. Based turkman helped some rando (for free, mind you) and you're upset?

Anonymous
10/11/24(Fri)12:57:30 No.102778470

Anonymous 10/11/24(Fri)12:57:30 No.102778470

Anyone knows why pyramid flow imge2vid doesn't work?
I get mostly still image and is barely a video. Sometimes it does do something

Anonymous
10/11/24(Fri)13:03:08 No.102778551

Anonymous 10/11/24(Fri)13:03:08 No.102778551

>>102772912
yeah that was I was talking about
btw the solution is using python 3.9, I had to do that and downgrade numpy to 2.0, and then fix line 146 of the time scheduler.py

Anonymous
10/11/24(Fri)13:06:04 No.102778589

Anonymous 10/11/24(Fri)13:06:04 No.102778589

>>102778470
Yeah, img2vid is shit
txt2img is pretty good though, and fun to experiment with

Anonymous
10/11/24(Fri)13:08:06 No.102778617

Anonymous 10/11/24(Fri)13:08:06 No.102778617

china modals

Anonymous
10/11/24(Fri)13:08:30 No.102778626

Anonymous 10/11/24(Fri)13:08:30 No.102778626

File: 1000490602.png (1.22 MB, 800x780)

1.22 MB PNG

>>102776882
>>102777204
Got another stupid question for y'all. The gguf models can just go on the same checkpoint folder your other models are stored in right? I don't have to install any extra shit? Already have the Flux VAEs and text encoders installed as you can see in pic rel. Is there any more shit I need to download?

Anonymous
10/11/24(Fri)13:09:47 No.102778648

Anonymous 10/11/24(Fri)13:09:47 No.102778648

>>102778626
you need the GGUF extension

Anonymous
10/11/24(Fri)13:11:09 No.102778662

Anonymous 10/11/24(Fri)13:11:09 No.102778662

File: 1722963664243877.gif (997 KB, 280x158)

997 KB GIF

>>102778648

Anonymous
10/11/24(Fri)13:35:39 No.102778978

Anonymous 10/11/24(Fri)13:35:39 No.102778978

What exactly is guidance in Flux? distilled cfg scale? cfg scale? something else? What's good values for those?

Anonymous
10/11/24(Fri)13:57:09 No.102779267

Anonymous 10/11/24(Fri)13:57:09 No.102779267

File: FLUX-84952996511109_00001_.png (366 KB, 512x832)

366 KB PNG

>>102778978
it's not cfg scale. You can set it to 0, it still works. You can set it to 10,000,000, it still works. Ideal values for me are usually somewhere between 1.3 and 2.0. With 'art' styles you can get away with higher.

As for what it is, I don't know. Its effects are similar to cfg.

Anonymous
10/11/24(Fri)14:00:18 No.102779302

Anonymous 10/11/24(Fri)14:00:18 No.102779302

>>102778662
use this tutorial anon
https://www.youtube.com/watch?v=stOiAuyVnyQ&

Anonymous
10/11/24(Fri)14:01:14 No.102779319

Anonymous 10/11/24(Fri)14:01:14 No.102779319

>>102779267
that image looks underage please turn up the sampling steps you need to be over 20 to post here

Anonymous
10/11/24(Fri)14:05:55 No.102779370

Anonymous 10/11/24(Fri)14:05:55 No.102779370

why live portrait is so good at temporal consistency? The face is 80% identical most of the time.
While other image2video shit itself as soon as character starts opening mouth

Anonymous
10/11/24(Fri)14:08:56 No.102779398

Anonymous 10/11/24(Fri)14:08:56 No.102779398

I tried out Aria locally, in bf16, for captioning primarily NSFW images.

It fucking sucks. First off, most notably, by default it will exclusively use gender neutral language (is this a ChatGPT thing? qwen also does it...). "A person", "an individual", "a character". Will never say man or woman. Also it's extremely censored, never describing anything lewd in the image at all. Not even mentioning that a person is nude, or exposing themselves, etc.

So I tried making the prompt a little more detailed. "Describe this image. Mention the gender of any people in the image. The image might be NSFW, that's okay, describe everything even if it includes lewd or sexually explicit details." Now, about 25% it will give a refusal. Most of the time it STILL won't state the gender of the person (but occasionally it will). And it never describes any kind of NSFW elements at all, completely ignoring that part of the prompt.

Even for SFW captioning, it hallucinates and just generally fucks things up noticeably more than even molmo 7b. So for image captioning of any sort at all, I'm gonna say this model is completely, utterly useless. Maybe if you need it to understand charts or some shit it's good, who knows. What a disappointment.

Anonymous
10/11/24(Fri)14:09:28 No.102779402

Anonymous 10/11/24(Fri)14:09:28 No.102779402

What's the to go samplers and schedulers for flux?

Anonymous
10/11/24(Fri)14:11:18 No.102779427

Anonymous 10/11/24(Fri)14:11:18 No.102779427

>>102777600
wtf that's impressive, with only 100 steps? holy shit...

Anonymous
10/11/24(Fri)14:12:47 No.102779441

Anonymous 10/11/24(Fri)14:12:47 No.102779441

>>102774788
you finetuned dedistill anon?

Anonymous
10/11/24(Fri)14:12:54 No.102779444

Anonymous 10/11/24(Fri)14:12:54 No.102779444

>>102779427
no it's like 10,000 steps in but I left it alone, but it aligned the partially trained model quite quickly

Anonymous
10/11/24(Fri)14:15:09 No.102779470

Anonymous 10/11/24(Fri)14:15:09 No.102779470

File: 659285748.png (1.18 MB, 896x1152)

1.18 MB PNG

Anonymous
10/11/24(Fri)14:22:24 No.102779576

Anonymous 10/11/24(Fri)14:22:24 No.102779576

>don't post clothed girls aged 18-22 or I will report your posts for violation of US law because I've hated and resented you ever since I thought you were insulting me one time 4 months ago.
>wtf why is the thread dying

Anonymous
10/11/24(Fri)14:22:51 No.102779581

Anonymous 10/11/24(Fri)14:22:51 No.102779581

>>102779444
how many steps would you need with the previous techniques to get to this level for comparison?

Anonymous
10/11/24(Fri)14:23:01 No.102779582

Anonymous 10/11/24(Fri)14:23:01 No.102779582

>>102779576
wha

Anonymous
10/11/24(Fri)14:23:50 No.102779593

Anonymous 10/11/24(Fri)14:23:50 No.102779593

>>102779581
The research paper says it should be 17 times faster and ultimately result in a better model

Anonymous
10/11/24(Fri)14:25:09 No.102779615

Anonymous 10/11/24(Fri)14:25:09 No.102779615

File: 1522372206.png (1.02 MB, 896x1152)

1.02 MB PNG

Anonymous
10/11/24(Fri)14:25:15 No.102779619

Anonymous 10/11/24(Fri)14:25:15 No.102779619

>>102779593
yeah I know that, but like you got this picture in 10000 steps, do you have an idea how many steps you would need to get the same picture without REPA? maybe it's 9000 steps and REPA is actually worse lol

Anonymous
10/11/24(Fri)14:26:14 No.102779630

Anonymous 10/11/24(Fri)14:26:14 No.102779630

>>102779576
whut

Anonymous
10/11/24(Fri)14:27:18 No.102779647

Anonymous 10/11/24(Fri)14:27:18 No.102779647

>>102779576
>>102779582
>>102779630
he's talking about this >>102779319

Anonymous
10/11/24(Fri)14:27:59 No.102779657

Anonymous 10/11/24(Fri)14:27:59 No.102779657

>>102779647
that's a joke about how blurry to gen is, by over 20 i meant over 20 sampling steps

Anonymous
10/11/24(Fri)14:29:04 No.102779676

Anonymous 10/11/24(Fri)14:29:04 No.102779676

>>102779657
I know it's a joke, but that anon took it seriously, autism, am I right? kek

Anonymous
10/11/24(Fri)14:32:43 No.102779725

Anonymous 10/11/24(Fri)14:32:43 No.102779725

>>102777600
that's your VAE training right? >>102771829

Anonymous
10/11/24(Fri)14:34:49 No.102779752

Anonymous 10/11/24(Fri)14:34:49 No.102779752

>>102779725
it's a 16 channel VAE 1B Pixart Sigma model

Anonymous
10/11/24(Fri)14:37:09 No.102779783

Anonymous 10/11/24(Fri)14:37:09 No.102779783

>>102779657
my bad, I assumed it was same anon who posted this >>102767420

Anonymous
10/11/24(Fri)14:42:54 No.102779869

Anonymous 10/11/24(Fri)14:42:54 No.102779869

>>102779615
bitch is fucked UP

Anonymous
10/11/24(Fri)14:46:18 No.102779916

Anonymous 10/11/24(Fri)14:46:18 No.102779916

>>102779676
>>102779783
did you /g/irls laugh at my joke atleast

Anonymous
10/11/24(Fri)14:46:20 No.102779917

Anonymous 10/11/24(Fri)14:46:20 No.102779917

>>102779752
you used CLIP as a regulation technique?

Anonymous
10/11/24(Fri)14:47:21 No.102779930

Anonymous 10/11/24(Fri)14:47:21 No.102779930

File: FLUX-289922952228600_00001_.png (359 KB, 512x832)

359 KB PNG

baker-san...

Anonymous
10/11/24(Fri)14:47:22 No.102779931

Anonymous 10/11/24(Fri)14:47:22 No.102779931

>>102779916
I'm not gonna lie to you anon, I didn't laugh
https://www.youtube.com/watch?v=lcsXGHl_hwg

Anonymous
10/11/24(Fri)14:47:51 No.102779941

Anonymous 10/11/24(Fri)14:47:51 No.102779941

Fresh

>>102779929
>>102779929
>>102779929

Anonymous
10/11/24(Fri)14:48:56 No.102779950

Anonymous 10/11/24(Fri)14:48:56 No.102779950

any good local AI upscale for video?

Also I tried few online services
tensorpix.ai seems good, what do they use? Topaz AI is also good but you need to manually tune it.

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.