/g/ - /ldg/ - Local Diffusion General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/ldg/ - Local Diffusion Genera(...) 10/25/24(Fri)01:57:57 No.102964600

File: the longest dick general.jpg (2.69 MB, 3264x1562)

2.69 MB JPG

/ldg/ - Local Diffusion General Anonymous 10/25/24(Fri)01:57:57 No.102964600 Archived

Discussion of free and open source text-to-image models

Previous /ldg/ bred : >>102956911

Phrenologic Exploration Edition

>Beginner UI
Fooocus: https://github.com/lllyasviel/fooocus
EasyDiffusion: https://easydiffusion.github.io
Metastable: https://metastable.studio

>Advanced UI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
reForge: https://github.com/Panchovix/stable-diffusion-webui-reForge
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://aitracker.art
https://huggingface.co
https://civitai.com
https://tensor.art/models
https://liblib.art
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3

>SD3.5
https://huggingface.co/stabilityai/stable-diffusion-3.5-large
https://replicate.com/stability-ai/stable-diffusion-3.5-large

>Sana
https://github.com/NVlabs/Sana
https://sana-gen.mit.edu

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux
DeDistilled Quants: https://huggingface.co/TheYuriLover/flux-dev-de-distill-GGUF/tree/main

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest

>Maintain thread quality
https://rentry.org/debo

>Related boards
>>>/aco/sdg
>>>/aco/aivg
>>>/b/degen
>>>/c/kdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/tg/slop
>>>/trash/sdg
>>>/u/udg
>>>/vt/vtai

Anonymous
10/25/24(Fri)02:01:41 No.102964636

Anonymous 10/25/24(Fri)02:01:41 No.102964636

i fucking hate flux fuck fuck fuck thread those fucking bfl snakes

Anonymous
10/25/24(Fri)02:05:00 No.102964664

Anonymous 10/25/24(Fri)02:05:00 No.102964664

File: image (9).png (435 KB, 1024x1024)

435 KB PNG

Anonymous
10/25/24(Fri)02:07:47 No.102964695

Anonymous 10/25/24(Fri)02:07:47 No.102964695

File: 00007-2059676223.png (1.28 MB, 832x1216)

1.28 MB PNG

Anonymous
10/25/24(Fri)02:08:41 No.102964705

Anonymous 10/25/24(Fri)02:08:41 No.102964705

yep, it's sloppin' time

Anonymous
10/25/24(Fri)02:09:17 No.102964707

Anonymous 10/25/24(Fri)02:09:17 No.102964707

File: image (10).png (378 KB, 1024x1024)

378 KB PNG

Anonymous
10/25/24(Fri)02:09:38 No.102964712

Anonymous 10/25/24(Fri)02:09:38 No.102964712

>>102964636
don't be that jealous anon, it's not good for your mental health

Anonymous
10/25/24(Fri)02:10:25 No.102964715

Anonymous 10/25/24(Fri)02:10:25 No.102964715

File: image (5).png (407 KB, 1024x1024)

407 KB PNG

Anonymous
10/25/24(Fri)02:12:28 No.102964736

Anonymous 10/25/24(Fri)02:12:28 No.102964736

File: 1727216024.png (825 KB, 1024x1024)

825 KB PNG

Anonymous
10/25/24(Fri)02:13:31 No.102964744

Anonymous 10/25/24(Fri)02:13:31 No.102964744

>>102964712
jealous of licking bfl's dirty butthole, hoping, no, PRAYING, that one of the two fossilized shits they shat out will be usable if we clean it with enough money?

Anonymous
10/25/24(Fri)02:13:41 No.102964746

Anonymous 10/25/24(Fri)02:13:41 No.102964746

File: 00037-1868180960_cleanup.png (1.21 MB, 832x1216)

1.21 MB PNG

Anonymous
10/25/24(Fri)02:14:23 No.102964753

Anonymous 10/25/24(Fri)02:14:23 No.102964753

>>102964744
>one of the two fossilized shits they shat out will be usable if we clean it with enough money?
isn't that what happened with SDXL though?

Anonymous
10/25/24(Fri)02:15:24 No.102964760

Anonymous 10/25/24(Fri)02:15:24 No.102964760

>>102964753
yes the 2.6b model, wonder how much it will cost to unshit a 12b

Anonymous
10/25/24(Fri)02:16:39 No.102964769

Anonymous 10/25/24(Fri)02:16:39 No.102964769

Why is ldg quality so low now? What changed?

Anonymous
10/25/24(Fri)02:17:04 No.102964772

Anonymous 10/25/24(Fri)02:17:04 No.102964772

File: 1727215035.png (1.19 MB, 1024x1024)

1.19 MB PNG

Anonymous
10/25/24(Fri)02:17:43 No.102964775

Anonymous 10/25/24(Fri)02:17:43 No.102964775

0/10

Anonymous
10/25/24(Fri)02:18:11 No.102964779

Anonymous 10/25/24(Fri)02:18:11 No.102964779

flux/10

Anonymous
10/25/24(Fri)02:19:07 No.102964786

Anonymous 10/25/24(Fri)02:19:07 No.102964786

File: 1726986290.png (1.54 MB, 1024x1024)

1.54 MB PNG

Anonymous
10/25/24(Fri)02:19:38 No.102964789

Anonymous 10/25/24(Fri)02:19:38 No.102964789

>>102964760
>unshit a 12b
there's not much to "unshit" though, flux is already great as a base model, it just needs more concepts, the work is less significant than SDXL where we had to unfuck the anatomy

Anonymous
10/25/24(Fri)02:21:14 No.102964804

Anonymous 10/25/24(Fri)02:21:14 No.102964804

File: image (2).png (394 KB, 1024x1024)

394 KB PNG

>>102964786

Anonymous
10/25/24(Fri)02:21:34 No.102964807

Anonymous 10/25/24(Fri)02:21:34 No.102964807

>>102964789
>flux is already great as a base model
flux dev is, problem is the flux dev licence is dogshit, any work you do on flux will be legally owned by bfl. if anyone's going to pour money on flux it will be schnell

Anonymous
10/25/24(Fri)02:21:40 No.102964808

Anonymous 10/25/24(Fri)02:21:40 No.102964808

File: 1729643888_0001.png (1.16 MB, 1024x1024)

1.16 MB PNG

Anonymous
10/25/24(Fri)02:22:46 No.102964818

Anonymous 10/25/24(Fri)02:22:46 No.102964818

>>102964789
Any new concept requires thousands of steps and you have to do it as part of a full train with hundreds of thousands of images, it's not cheap or easy

Anonymous
10/25/24(Fri)02:23:24 No.102964820

Anonymous 10/25/24(Fri)02:23:24 No.102964820

>>102964807
>flux dev is, problem is the flux dev licence is dogshit,
I don't want to sound like a doomer, but we'll never get a great base model with the goated Apache 2.0 licence, even if it's the case for Mochi, it's only the 480p version, the HD version will probably have a shit ass licence too :(

Anonymous
10/25/24(Fri)02:25:44 No.102964834

Anonymous 10/25/24(Fri)02:25:44 No.102964834

File: 1729309735.png (1.63 MB, 1024x1024)

1.63 MB PNG

>>102964804

Anonymous
10/25/24(Fri)02:28:07 No.102964849

Anonymous 10/25/24(Fri)02:28:07 No.102964849

>>102964818
my point is that it's more trivial to add Gwar Gura onto a model as a concept rather than trying to improve a fucked up anatomy, yeah you need money but it's not like everyone that train model do it for the money, some people just want to improve a model and share their result with everyone

Anonymous
10/25/24(Fri)02:28:46 No.102964856

Anonymous 10/25/24(Fri)02:28:46 No.102964856

>>102964820
yeah.. but the sd 3.5 license is pretty decent, research use and non commercial is free and you only need to pay for commercial if you make more than $1 million in revenue annually, so fingers crossed medium turns out to be good

Anonymous
10/25/24(Fri)02:29:18 No.102964862

Anonymous 10/25/24(Fri)02:29:18 No.102964862

>>102964849
Loras only get you so far

Anonymous
10/25/24(Fri)02:30:04 No.102964868

Anonymous 10/25/24(Fri)02:30:04 No.102964868

>>102964856
>yeah.. but the sd 3.5 license is pretty decent
not decent enough for the pony fag, if he turned down SD3.5 it means that this fucker made more than a million out of ponyXL, which is absolutely insane

Anonymous
10/25/24(Fri)02:31:44 No.102964880

Anonymous 10/25/24(Fri)02:31:44 No.102964880

>>102964862
I never said anything about a lora, could be a finetune that simply adds a shit ton of characters from danbooru or some shit, you know it's gonna work fine on Flux because adding characters is easy enough if you have enough pictures of them, fixing the anatomy on the other hand is another story, and thankfully we don't have to deal with the hard concepts on flux, flux is good on the hard concepts, but not so good at trivial ones

Anonymous
10/25/24(Fri)02:34:55 No.102964897

Anonymous 10/25/24(Fri)02:34:55 No.102964897

>>102964868
ponyxl was a shitly trained model anways, i'd rather someone like the illustrious try a new finetune instead. and that's why lower param count models are better, because it makes finetuning more accessible to smaller groups and you don't have to rely on retard grifters like astralite for finetunes are much. big parameter counts aren't everything

Anonymous
10/25/24(Fri)02:36:52 No.102964910

Anonymous 10/25/24(Fri)02:36:52 No.102964910

>>102964868
>if he turned down SD3.5 it means that this fucker made more than a million out of ponyXL
Or he's just happy continuing with whatever he does

Anonymous
10/25/24(Fri)02:38:00 No.102964917

Anonymous 10/25/24(Fri)02:38:00 No.102964917

File: file.png (271 KB, 1633x1555)

271 KB PNG

>>102964897
I get that, I wished we would get Flux dev quality on a 0.1b model, but that's just utopia at this point, the scaling law is a thing and a small model's limit will always be lower than a bigger one
https://dynomight.net/scaling/

Anonymous
10/25/24(Fri)02:43:02 No.102964952

Anonymous 10/25/24(Fri)02:43:02 No.102964952

>>102963916
This will be great for merges. Seems way better than the previous version. I wonder if it could be used as base for lora training

Anonymous
10/25/24(Fri)02:43:23 No.102964957

Anonymous 10/25/24(Fri)02:43:23 No.102964957

File: 02163.jpg (1.73 MB, 1792x2304)

1.73 MB JPG

Anonymous
10/25/24(Fri)02:46:34 No.102964980

Anonymous 10/25/24(Fri)02:46:34 No.102964980

>>102964917
you need to remember that image gen is nowhere near the limits of the transformer architecture like llms are, the text gen space is filled with mega corporations pouring billions into squeezing out every possible point out of whatever benchmarks they are testing on, openai, anthropic, meta, qwen, whatever the fuck. image gen meanwhile is a much smaller market, filled with much smaller companies fiddling around with much smaller models and comparatively tiny compute. i don't see why we couldn't get flux level quality in a way smaller parameter count. maybe not 0.1b obviously but 5b? sure.

Anonymous
10/25/24(Fri)02:48:11 No.102964988

Anonymous 10/25/24(Fri)02:48:11 No.102964988

>>102964980
Isn't DALLE something like 4B?
If so, knowing how good it can look, pretty sure we didn't see nothing yet.

Anonymous
10/25/24(Fri)02:48:28 No.102964991

Anonymous 10/25/24(Fri)02:48:28 No.102964991

>>102964980
>i don't see why we couldn't get flux level quality in a way smaller parameter count. maybe not 0.1b obviously but 5b? sure.
I agree with that number, a perfectly trained 5b model could match flux dev, and that's why I believe Sana is gonna miss the mark, SD3M too, we need something medium, or else we have little shits or else we got this giant Flux model, there's still improvements to be made, and I won't forgive SAI for giving up on SD3-4b

Anonymous
10/25/24(Fri)02:50:55 No.102965010

Anonymous 10/25/24(Fri)02:50:55 No.102965010

File: image (3).png (436 KB, 1024x1024)

436 KB PNG

Anonymous
10/25/24(Fri)02:53:21 No.102965024

Anonymous 10/25/24(Fri)02:53:21 No.102965024

>>102964988
>Isn't DALLE something like 4B?
no clue, but openai barely cares about dalle because their main product is gpt and image gen is just too much legal trouble for little potential profit, and it still mogs everything else out there. so i think there's way more potential optimizations and research to be made for image gen, flux is just the tip of the iceberg.

Anonymous
10/25/24(Fri)02:56:25 No.102965041

Anonymous 10/25/24(Fri)02:56:25 No.102965041

>>102965024
>openai barely cares about dalle
that's funny you say that because yesterday they improved the turbo distillation
https://xcancel.com/OpenAI/status/1849139783362347293

Anonymous
10/25/24(Fri)02:58:34 No.102965055

Anonymous 10/25/24(Fri)02:58:34 No.102965055

>>102965024
>it still mogs everything else out there
it's probably because not only they didn't curate nsfw from their dataset, their synthetic captions are probably good because they used a good vlm

Anonymous
10/25/24(Fri)02:59:44 No.102965061

Anonymous 10/25/24(Fri)02:59:44 No.102965061

>>102965041
obviously they are going to make optimizations to make running dalle cheaper, but it's not their main focus is what i meant.

Anonymous
10/25/24(Fri)03:02:26 No.102965078

Anonymous 10/25/24(Fri)03:02:26 No.102965078

>>102965055
>curate nsfw from their dataset
still pissed off that it seems all current release start with a "safety" paragraph just meaning "ok so we got rid of all the useful anatomy data that is porn, because who knows people could generate that"

Anonymous
10/25/24(Fri)03:03:34 No.102965088

Anonymous 10/25/24(Fri)03:03:34 No.102965088

File: 02166.jpg (2.1 MB, 1792x2304)

2.1 MB JPG

Anonymous
10/25/24(Fri)03:04:33 No.102965098

Anonymous 10/25/24(Fri)03:04:33 No.102965098

>>102965088
what if you went outside and saw this thing up in the sky trying to inhale the earth

Anonymous
10/25/24(Fri)03:07:05 No.102965111

Anonymous 10/25/24(Fri)03:07:05 No.102965111

File: image (16).png (438 KB, 1024x1024)

438 KB PNG

Anonymous
10/25/24(Fri)03:08:45 No.102965122

Anonymous 10/25/24(Fri)03:08:45 No.102965122

File: 00040-1254627563.jpg (438 KB, 896x1152)

438 KB JPG

Anonymous
10/25/24(Fri)03:09:21 No.102965127

Anonymous 10/25/24(Fri)03:09:21 No.102965127

>>102965078
>still pissed off that it seems all current release start with a "safety" paragraph just meaning "ok so we got rid of all the useful anatomy data that is porn, because who knows people could generate that"
I was born too soon desu, in 20 years it'll be the norm to release uncucked models because there's alaways someone with balls who will dare to do it first and show to the masses that it's not the end of the world

Anonymous
10/25/24(Fri)03:12:03 No.102965140

Anonymous 10/25/24(Fri)03:12:03 No.102965140

>>102965127
It's mostly a matter of compute.
And time is with us on that.

Anonymous
10/25/24(Fri)03:15:29 No.102965160

Anonymous 10/25/24(Fri)03:15:29 No.102965160

The number of steps definitely have an impact
>bf16 64 steps
https://files.catbox.moe/z5r6yu.webm
>bf16 200 steps
https://files.catbox.moe/qvy7jx.webm

Anonymous
10/25/24(Fri)03:22:19 No.102965195

Anonymous 10/25/24(Fri)03:22:19 No.102965195

>>102965160
https://github.com/genmoai/models/issues/2#issuecomment-2434139372
>While in preview, Mochi 1 only supports text-to-video. As a quick hack, you could describe the image extensively as the T5 XXL encoder supports long prompts, but that is a suboptimal solution since it'll lose most of the detail. We know I2V is important for the community so stay tuned.
nice, a model that can't do image2video is shit desu

Anonymous
10/25/24(Fri)03:24:38 No.102965207

Anonymous 10/25/24(Fri)03:24:38 No.102965207

>>102965160
https://www.reddit.com/link/1gbg4ot/video/dm9ktw05cswd1/player
30 minutes
https://old.reddit.com/r/StableDiffusion/comments/1gbg4ot/mochi_1animation_24gb_vram_fp8/
I was planning suring today to test Q8 but my day has started early :(

Anonymous
10/25/24(Fri)03:39:03 No.102965286

Anonymous 10/25/24(Fri)03:39:03 No.102965286

File: file.png (43 KB, 810x573)

43 KB PNG

ok so I think if we want local Mochi to be as close as possible as the demo, you should go for 163 frames, and 30 fps, looks like this is the settings the model has been the most trained on, I'll provide a video once it's been done

Anonymous
10/25/24(Fri)03:40:50 No.102965299

Anonymous 10/25/24(Fri)03:40:50 No.102965299

>>102965286
If you can do multigpu, maybe it's possible with 2x3090.

Anonymous
10/25/24(Fri)03:41:36 No.102965304

Anonymous 10/25/24(Fri)03:41:36 No.102965304

>>102965195
good that they work on that, it's so much easier so start with an image you already genned or you just want to animate

Anonymous
10/25/24(Fri)03:43:15 No.102965313

Anonymous 10/25/24(Fri)03:43:15 No.102965313

File: file.png (88 KB, 1489x540)

88 KB PNG

>>102965299
it's already possible if you go for Q8, it asks for 19gb of vram

Anonymous
10/25/24(Fri)03:45:00 No.102965320

Anonymous 10/25/24(Fri)03:45:00 No.102965320

File: 1729835388.png (565 KB, 522x672)

565 KB PNG

Anonymous
10/25/24(Fri)03:47:55 No.102965337

Anonymous 10/25/24(Fri)03:47:55 No.102965337

File: file.png (42 KB, 741x521)

42 KB PNG

>>102965286
just as a comparison, Minimax videos are running on 25 fps + 141 frames

Anonymous
10/25/24(Fri)03:55:24 No.102965372

Anonymous 10/25/24(Fri)03:55:24 No.102965372

>>102965286
>Mochi
>163 frames, and 30 fps
5.43 sec
>>102965337
>Minimax
>25 fps + 141 frames
5.64 sec

really small difference kek
https://www.youtube.com/watch?v=ekkSkdWt0mQ

Anonymous
10/25/24(Fri)03:56:26 No.102965376

Anonymous 10/25/24(Fri)03:56:26 No.102965376

>>102965286
30fps seem to be what people render at, wether is the temporal speed of the model or not we don't actually know.

Anonymous
10/25/24(Fri)03:57:20 No.102965383

Anonymous 10/25/24(Fri)03:57:20 No.102965383

>>102965376
>wether is the temporal speed of the model or not we don't actually know.
it is the temporal speed of the model, because the script tested a video straight from the API demo of Mochi

Anonymous
10/25/24(Fri)04:02:10 No.102965415

Anonymous 10/25/24(Fri)04:02:10 No.102965415

>>102965383
Thanks. I didn't know that.
https://files.catbox.moe/5uu88o.mp4

Anonymous
10/25/24(Fri)04:09:26 No.102965458

Anonymous 10/25/24(Fri)04:09:26 No.102965458

File: 00098-1254627564.jpg (330 KB, 1152x1536)

330 KB JPG

asp2 wrangling

Anonymous
10/25/24(Fri)04:54:37 No.102965651

Anonymous 10/25/24(Fri)04:54:37 No.102965651

File: 00170-1254627563.jpg (504 KB, 1728x1344)

504 KB JPG

Anonymous
10/25/24(Fri)05:04:30 No.102965708

Anonymous 10/25/24(Fri)05:04:30 No.102965708

I've been generating a lot of images and I'm pretty sure messing with the Flux sigmas just gets you less intelligent results (more mistakes).

I don't think I'll be touching that stuff anymore.

Anonymous
10/25/24(Fri)06:38:20 No.102966229

Anonymous 10/25/24(Fri)06:38:20 No.102966229

File: cm2olj5t4000u336pgh8mqarv.webm (907 KB, 1696x960)

907 KB WEBM

The most annoying thing about genmo is you can absolutely see the potential but it's just not good enough

Anonymous
10/25/24(Fri)06:58:28 No.102966341

Anonymous 10/25/24(Fri)06:58:28 No.102966341

File: cm2omak0x00fr336pzgsl18bn.webm (3.32 MB, 1696x960)

3.32 MB WEBM

Anonymous
10/25/24(Fri)07:09:42 No.102966406

Anonymous 10/25/24(Fri)07:09:42 No.102966406

File: cm2ommyd10197336ptr744xwu.webm (2.15 MB, 1696x960)

2.15 MB WEBM

This one is almost good

>Under the ultraviolet pink and blue glow of holographic advertisements, two young Russian teenage girls in sleek, angular white outfits stand and pose on a high-rise rooftop. The sprawling cityscape sprawls beneath them, filled with pulsing neon lights. The camera captures their movements and beautiful face in slow motion as rain begins to fall, creating a shimmering effect on the rooftop

Maybe "teenage" is what their captioner used

Anonymous
10/25/24(Fri)07:13:26 No.102966437

Anonymous 10/25/24(Fri)07:13:26 No.102966437

>>102966406
perhaps smaller resolution gives better results?

Anonymous
10/25/24(Fri)07:19:50 No.102966494

Anonymous 10/25/24(Fri)07:19:50 No.102966494

Is it correct to say that RTX 3090 (24GB) and RTX 4090 (24GB) cannot run the original 24GB flux1-dev.safetensors at full speed?

Because I don't get full speed on my 3090, I have to use the 12GB quant. Is this just a problem with my setup?

Anonymous
10/25/24(Fri)07:37:07 No.102966578

Anonymous 10/25/24(Fri)07:37:07 No.102966578

>>102966437
>perhaps smaller resolution gives better results?
I wouldn't know, I'm using the website so I can't customize anything, but there's no point in generating anything smaller than 720p for my usecase anyways

Anonymous
10/25/24(Fri)07:40:48 No.102966598

Anonymous 10/25/24(Fri)07:40:48 No.102966598

>>102966578
have you tried this yet? requires pretty beefy gpu tho
>https://github.com/kijai/ComfyUI-CogVideoXWrapper

Anonymous
10/25/24(Fri)07:52:04 No.102966659

Anonymous 10/25/24(Fri)07:52:04 No.102966659

File: cm2oo4k3y002y336p2gqk4fwa.webm (1.25 MB, 1696x960)

1.25 MB WEBM

I replaced "angular white outfits" with "blue leotards" and I got belly. Looks like ages are still inconsistent though

>>102966598
CogVideoX doesn't gave high enough framerate unfortunately

Anonymous
10/25/24(Fri)07:54:53 No.102966680

Anonymous 10/25/24(Fri)07:54:53 No.102966680

>>102965708
I don't know it works pretty good for 1seagull gens

Anonymous
10/25/24(Fri)08:01:44 No.102966720

Anonymous 10/25/24(Fri)08:01:44 No.102966720

File: 2024-10-25_00003_.png (1.58 MB, 1024x1024)

1.58 MB PNG

Anonymous
10/25/24(Fri)08:04:11 No.102966729

Anonymous 10/25/24(Fri)08:04:11 No.102966729

>>102964980
>image gen meanwhile is a much smaller market, filled with much smaller companies fiddling around with much smaller models
Because size does not matter, Dalle 3 outclasses Flux at 1/3 of the size.

Anonymous
10/25/24(Fri)08:19:45 No.102966821

Anonymous 10/25/24(Fri)08:19:45 No.102966821

File: 1725174399046033.png (812 KB, 883x911)

812 KB PNG

sd 3.5 likeness seems very bad compared to flux

Anonymous
10/25/24(Fri)08:23:32 No.102966845

Anonymous 10/25/24(Fri)08:23:32 No.102966845

File: 2024-10-25_00004_.png (1.75 MB, 1024x1024)

1.75 MB PNG

>>102966720

Anonymous
10/25/24(Fri)08:25:14 No.102966856

Anonymous 10/25/24(Fri)08:25:14 No.102966856

>>102966821
could be user error

Anonymous
10/25/24(Fri)08:26:00 No.102966869

Anonymous 10/25/24(Fri)08:26:00 No.102966869

>>102966856
Also, Emma has changed a ton. She's unrecognizable now.

Anonymous
10/25/24(Fri)08:27:01 No.102966877

Anonymous 10/25/24(Fri)08:27:01 No.102966877

LOAD ALL THE LORAS

Anonymous
10/25/24(Fri)08:28:12 No.102966887

Anonymous 10/25/24(Fri)08:28:12 No.102966887

File: 2024-10-25_00005_.png (2.13 MB, 1024x1024)

2.13 MB PNG

>>102966845
Dishonesty at -.05 obliterates the image. Actually, it looks like a Canon 1DX at expanded iso.

Anonymous
10/25/24(Fri)08:41:01 No.102966995

Anonymous 10/25/24(Fri)08:41:01 No.102966995

File: 00350-1254627563.jpg (489 KB, 1120x1440)

489 KB JPG

Anonymous
10/25/24(Fri)08:57:31 No.102967116

Anonymous 10/25/24(Fri)08:57:31 No.102967116

>>102966659
why is it so BLURRY

Anonymous
10/25/24(Fri)08:58:04 No.102967122

Anonymous 10/25/24(Fri)08:58:04 No.102967122

31 minutes on 4060ti 16gb GGUF_Q8_0 mochi 67 frames 64 steps.

https://files.catbox.moe/0fkt54.mp4

Going to use this prompt and see how it comes out >>102966406

Anonymous
10/25/24(Fri)09:05:44 No.102967195

Anonymous 10/25/24(Fri)09:05:44 No.102967195

File: 2024-10-25_00008_.png (1.14 MB, 720x1280)

1.14 MB PNG

Anonymous
10/25/24(Fri)09:26:28 No.102967406

Anonymous 10/25/24(Fri)09:26:28 No.102967406

File: 00147-1620015057.jpg (1.16 MB, 1408x2064)

1.16 MB JPG

Mixuco!

Anonymous
10/25/24(Fri)09:26:36 No.102967408

Anonymous 10/25/24(Fri)09:26:36 No.102967408

File: 2024-10-25_00010_.png (1.71 MB, 720x1280)

1.71 MB PNG

>>102967195

Anonymous
10/25/24(Fri)09:35:56 No.102967486

Anonymous 10/25/24(Fri)09:35:56 No.102967486

>>102967116
>why is it so BLURRY
Because videos are blurry
Maybe it's overtrained on bokeh like base flux is
>>102967122
>Going to use this prompt and see how it comes out
Since it's been more than 31 minutes now I'm assuming your result wasn't good enough to post (or too young)

Anonymous
10/25/24(Fri)09:38:02 No.102967505

Anonymous 10/25/24(Fri)09:38:02 No.102967505

I've seen some pretty good results coming out of minimax/hailuo. Is there a way to run it locally in a similar fashion to comfyui?

Anonymous
10/25/24(Fri)09:47:34 No.102967600

Anonymous 10/25/24(Fri)09:47:34 No.102967600

>>102967505
To get something as good as minimax/hailuo, you'd need 100GB+ vram, but also the model.
There is nothing as good locally, nor the ability to run it, and so far I've not been impressed by the blurry mess posted here for Mochi 1.

Anonymous
10/25/24(Fri)09:48:02 No.102967610

Anonymous 10/25/24(Fri)09:48:02 No.102967610

File: ComfyUI_02636_.png (1.25 MB, 1024x1024)

1.25 MB PNG

Anonymous
10/25/24(Fri)09:52:19 No.102967663

Anonymous 10/25/24(Fri)09:52:19 No.102967663

File: ComfyUI_02598_.png (1.56 MB, 1024x1024)

1.56 MB PNG

Anonymous
10/25/24(Fri)09:59:05 No.102967728

Anonymous 10/25/24(Fri)09:59:05 No.102967728

>>102967486
nah i had a shower and made homemade pizza!
Turned out reasonably ok, for what it is, Q8 and all.

here you go:
https://files.catbox.moe/0et1cj.mp4

Prompt executed in 1811.06 seconds

Anonymous
10/25/24(Fri)09:59:24 No.102967730

Anonymous 10/25/24(Fri)09:59:24 No.102967730

File: ComfyUI_02651_.png (1.55 MB, 1024x1024)

1.55 MB PNG

Anonymous
10/25/24(Fri)10:18:17 No.102967900

Anonymous 10/25/24(Fri)10:18:17 No.102967900

File: 1418041423282.jpg (20 KB, 306x306)

20 KB JPG

>thought I'd have more control over facial features to be be able to better create individual looking characters in Flux
>nah

Anonymous
10/25/24(Fri)10:30:47 No.102968031

Anonymous 10/25/24(Fri)10:30:47 No.102968031

I want my specific fetish to be in the next big AI model.

I have 1000 pics manually tagged dataset.
Which website do I upload it to make sure it will be added in the next databas
e scrap?
danbooru.donmai.us ?

Anonymous
10/25/24(Fri)10:34:15 No.102968056

Anonymous 10/25/24(Fri)10:34:15 No.102968056

File: 3024981597.png (1.36 MB, 1344x768)

1.36 MB PNG

Anonymous
10/25/24(Fri)10:36:37 No.102968086

Anonymous 10/25/24(Fri)10:36:37 No.102968086

File: ComfyUI_01596_.png (1.27 MB, 1024x1024)

1.27 MB PNG

Anonymous
10/25/24(Fri)10:48:58 No.102968173

Anonymous 10/25/24(Fri)10:48:58 No.102968173

File: 2024-10-25_00013_.png (1.4 MB, 720x1280)

1.4 MB PNG

Anonymous
10/25/24(Fri)10:52:07 No.102968197

Anonymous 10/25/24(Fri)10:52:07 No.102968197

i demand a chibi migu in a suit
now

Anonymous
10/25/24(Fri)11:05:03 No.102968324

Anonymous 10/25/24(Fri)11:05:03 No.102968324

File: ComfyUI_01600_.png (1.82 MB, 1024x1024)

1.82 MB PNG

Anonymous
10/25/24(Fri)11:12:51 No.102968391

Anonymous 10/25/24(Fri)11:12:51 No.102968391

File: 1876002013.png (1.35 MB, 1536x640)

1.35 MB PNG

Anonymous
10/25/24(Fri)11:16:16 No.102968432

Anonymous 10/25/24(Fri)11:16:16 No.102968432

File: 1712539816.png (1.07 MB, 1536x640)

1.07 MB PNG

Anonymous
10/25/24(Fri)11:18:57 No.102968458

Anonymous 10/25/24(Fri)11:18:57 No.102968458

File: ComfyUI_01607_.png (1.02 MB, 1024x1024)

1.02 MB PNG

Anonymous
10/25/24(Fri)11:26:02 No.102968526

Anonymous 10/25/24(Fri)11:26:02 No.102968526

File: file.png (491 KB, 448x544)

491 KB PNG

Anonymous
10/25/24(Fri)11:26:16 No.102968530

Anonymous 10/25/24(Fri)11:26:16 No.102968530

cozy thread

Anonymous
10/25/24(Fri)11:26:40 No.102968537

Anonymous 10/25/24(Fri)11:26:40 No.102968537

>>102964600
>Model Ranking
>https://imgsys.org/rankings
wow SD3.5 not even on the list, it must be really shit compared to flux then

Anonymous
10/25/24(Fri)11:27:18 No.102968542

Anonymous 10/25/24(Fri)11:27:18 No.102968542

File: 306218340094574596.webm (899 KB, 1248x720)

899 KB WEBM

Anonymous
10/25/24(Fri)11:29:15 No.102968558

Anonymous 10/25/24(Fri)11:29:15 No.102968558

>>102968542
it's okay to be gay

Anonymous
10/25/24(Fri)11:31:15 No.102968577

Anonymous 10/25/24(Fri)11:31:15 No.102968577

>>102968530
cozy thread to be mass reported

Anonymous
10/25/24(Fri)11:35:20 No.102968633

Anonymous 10/25/24(Fri)11:35:20 No.102968633

File: file.png (307 KB, 448x544)

307 KB PNG

Anonymous
10/25/24(Fri)11:41:15 No.102968686

Anonymous 10/25/24(Fri)11:41:15 No.102968686

>>102968633
clowns r funny :)

Anonymous
10/25/24(Fri)11:41:38 No.102968694

Anonymous 10/25/24(Fri)11:41:38 No.102968694

>>102968577
total janny death and so on and so forth

Anonymous
10/25/24(Fri)11:42:58 No.102968708

Anonymous 10/25/24(Fri)11:42:58 No.102968708

File: what_is_it.png (54 KB, 759x421)

54 KB PNG

I am going to see if I can re-inject some images back into mochi. It says models on one side and samples on the other. It doesn't appear to be latents (as I have thrown VAE decode node on it and failed horribly). Has anyone tried this and got anywhere?

Anonymous
10/25/24(Fri)11:48:21 No.102968775

Anonymous 10/25/24(Fri)11:48:21 No.102968775

File: Mochi_00002.webm (192 KB, 856x480)

192 KB WEBM

Anonymous
10/25/24(Fri)11:49:43 No.102968799

Anonymous 10/25/24(Fri)11:49:43 No.102968799

File: monroe.jpg (42 KB, 400x600)

42 KB JPG

is there a word for this? trying to hold dress down while wind is blowing it up

Anonymous
10/25/24(Fri)11:51:04 No.102968812

Anonymous 10/25/24(Fri)11:51:04 No.102968812

>>102968799
You're gonna need a lora for that I think

Anonymous
10/25/24(Fri)11:57:51 No.102968872

Anonymous 10/25/24(Fri)11:57:51 No.102968872

>>102968799
Billowing up/upwards

Anonymous
10/25/24(Fri)11:58:21 No.102968876

Anonymous 10/25/24(Fri)11:58:21 No.102968876

File: 00531-1615470503.jpg (1.09 MB, 1260x1620)

1.09 MB JPG

>>102968694
It's so tiresome

>>102968799
Drop that image to interrogator, should be doable with just prompt

Anonymous
10/25/24(Fri)12:00:56 No.102968903

Anonymous 10/25/24(Fri)12:00:56 No.102968903

File: 1708550557580330.png (467 KB, 399x399)

467 KB PNG

back from the three day ban after posting a "body horror" image (pic unrelated)
what did i miss?

Anonymous
10/25/24(Fri)12:03:50 No.102968937

Anonymous 10/25/24(Fri)12:03:50 No.102968937

>>102968903
Sana and SD 3.5M waiting room.

Anonymous
10/25/24(Fri)12:04:07 No.102968940

Anonymous 10/25/24(Fri)12:04:07 No.102968940

>>102968903
>467 KB PNG
>back from the three day ban after posting a "body horror" image
What the fuck?

Anonymous
10/25/24(Fri)12:04:56 No.102968944

Anonymous 10/25/24(Fri)12:04:56 No.102968944

can you catbox the body horror i wanna see it

Anonymous
10/25/24(Fri)12:06:36 No.102968965

Anonymous 10/25/24(Fri)12:06:36 No.102968965

File: Mochi_preview_00002.webm (726 KB, 856x480)

726 KB WEBM

>>102968708
I guess I will jump into the code. I think just a few keyframes will clean this up.

Anonymous
10/25/24(Fri)12:10:26 No.102969011

Anonymous 10/25/24(Fri)12:10:26 No.102969011

File: 00549-2887602201.jpg (402 KB, 1344x1728)

402 KB JPG

Anonymous
10/25/24(Fri)12:13:16 No.102969048

Anonymous 10/25/24(Fri)12:13:16 No.102969048

>>102967728
>homemade pizza!
gonna need a finetune for that I'm afraid
>here you go
Thanks for sharing anon they look stylish. Good to see that maybe genmo just needs good prompts + luck because the quality didn't degrade very much at all going from 16 to 8 bit and from the websites model to the 480p one

>>102968542
Did you change the prompt for ages at all for this gen? This is close enough to the age I'm trying to get so maybe you cracked the code or maybe just got lucky

Anonymous
10/25/24(Fri)12:14:37 No.102969068

Anonymous 10/25/24(Fri)12:14:37 No.102969068

File: 00175-1733960784.png (1.04 MB, 896x1152)

1.04 MB PNG

Anonymous
10/25/24(Fri)12:21:22 No.102969128

Anonymous 10/25/24(Fri)12:21:22 No.102969128

>>102968937
so nothing new?
>>102968944
>>102926927

Anonymous
10/25/24(Fri)12:22:11 No.102969140

Anonymous 10/25/24(Fri)12:22:11 No.102969140

>>102969048
didn't change it

Anonymous
10/25/24(Fri)12:27:39 No.102969193

Anonymous 10/25/24(Fri)12:27:39 No.102969193

File: 2x_upscale_video_00003.webm (1.75 MB, 1712x960)

1.75 MB WEBM

Prompt executed in 1286.83 seconds.
Q8, 65 steps, 73 frames
Ran upscaling afterwards

Anonymous
10/25/24(Fri)12:28:02 No.102969198

Anonymous 10/25/24(Fri)12:28:02 No.102969198

File: 3723796886.png (1.01 MB, 1344x768)

1.01 MB PNG

Anonymous
10/25/24(Fri)12:32:57 No.102969251

Anonymous 10/25/24(Fri)12:32:57 No.102969251

File: 00574-2887602198.jpg (349 KB, 1344x1728)

349 KB JPG

>>102969193
Looks very nice after rife interpolation

Anonymous
10/25/24(Fri)12:39:57 No.102969307

Anonymous 10/25/24(Fri)12:39:57 No.102969307

File: 00579-2887602198.jpg (430 KB, 1344x1728)

430 KB JPG

Anonymous
10/25/24(Fri)12:41:43 No.102969326

Anonymous 10/25/24(Fri)12:41:43 No.102969326

File: 00163-3004118374.jpg (850 KB, 1280x1720)

850 KB JPG

Anonymous
10/25/24(Fri)12:44:59 No.102969370

Anonymous 10/25/24(Fri)12:44:59 No.102969370

File: lmao.png (862 KB, 1049x778)

862 KB PNG

I wonder how often that happens on civitai

Anonymous
10/25/24(Fri)12:46:26 No.102969382

Anonymous 10/25/24(Fri)12:46:26 No.102969382

>>102969370
SHUT IT DOWN NOW

Anonymous
10/25/24(Fri)12:49:16 No.102969420

Anonymous 10/25/24(Fri)12:49:16 No.102969420

>>102969370
BRO

Anonymous
10/25/24(Fri)12:49:36 No.102969424

Anonymous 10/25/24(Fri)12:49:36 No.102969424

>>102969370
kek, close civitai

Anonymous
10/25/24(Fri)12:50:18 No.102969432

Anonymous 10/25/24(Fri)12:50:18 No.102969432

>>102969370
HOLY SHIT AN AI GENERATED NAKED CHILD JUST FLEW OVER MY HOUSE

Anonymous
10/25/24(Fri)12:54:20 No.102969468

Anonymous 10/25/24(Fri)12:54:20 No.102969468

Is it possible to use a diffusers model with ComfyUI? I trained a LoRA with SimpleTuner and it only works with the diffusers library...

Anonymous
10/25/24(Fri)13:03:38 No.102969566

Anonymous 10/25/24(Fri)13:03:38 No.102969566

File: FLUX-795955624424718_00001_.png (271 KB, 576x640)

271 KB PNG

I've been genning some cute libtarts

Anonymous
10/25/24(Fri)13:12:48 No.102969664

Anonymous 10/25/24(Fri)13:12:48 No.102969664

File: ComfyUI_Flux_14574.jpg (226 KB, 832x1216)

226 KB JPG

Anonymous
10/25/24(Fri)13:22:24 No.102969759

Anonymous 10/25/24(Fri)13:22:24 No.102969759

File: ComfyUI_00850_.png (1.62 MB, 1280x1024)

1.62 MB PNG

Anonymous
10/25/24(Fri)13:24:19 No.102969787

Anonymous 10/25/24(Fri)13:24:19 No.102969787

>>102969664
Very cool. I like the painted metal texture. From the neck down, at least. The head... not so much.

Anonymous
10/25/24(Fri)13:29:09 No.102969844

Anonymous 10/25/24(Fri)13:29:09 No.102969844

Best prompt?

Anonymous
10/25/24(Fri)13:31:31 No.102969885

Anonymous 10/25/24(Fri)13:31:31 No.102969885

File: cm2p02s97001o336ppkeyj8dq.webm (1.56 MB, 1696x960)

1.56 MB WEBM

>>102969140
>didn't change it
Understandable

>>102969193
Yeah this prompt is the best one so far. gonna work on variations of it when I have more time next week

Anonymous
10/25/24(Fri)13:31:37 No.102969889

Anonymous 10/25/24(Fri)13:31:37 No.102969889

File: FLUX-115905973033274_00001_.png (331 KB, 512x640)

331 KB PNG

>>102969370
lmfao it's also really low quality and slimy. I remember whenever we had some pedo spamming in here they'd always be the worst gens you've ever seen even from a technical point of view. Either it was a case of debilitating mental handicap leading to both behaviors or it was bad actors who don't give a shit about AI image gen and just copied their settings from a civitai workflow.

Anonymous
10/25/24(Fri)13:34:08 No.102969924

Anonymous 10/25/24(Fri)13:34:08 No.102969924

>>102969664
predicting with 99% confidence this one will be prominent in the collage

Anonymous
10/25/24(Fri)13:35:32 No.102969945

Anonymous 10/25/24(Fri)13:35:32 No.102969945

File: cm2p0fjvz002v336p69n24sxz.webm (2.38 MB, 1696x960)

2.38 MB WEBM

Anonymous
10/25/24(Fri)13:37:16 No.102969965

Anonymous 10/25/24(Fri)13:37:16 No.102969965

>>102969193
Try and run frame interpolation on it too with RIFE. The frames are all fake generated anyways so what is a few more?
https://github.com/hzwer/Practical-RIFE

Anonymous
10/25/24(Fri)13:39:38 No.102969994

Anonymous 10/25/24(Fri)13:39:38 No.102969994

>>102968876
lol, good gen. chainmail didn't quite work out though.

The inability of these models to do keyboards and chainmail and things of that nature seems like the next big limitation I'd be trying to solve if I were a researcher

>>102968526
>448x544
based, looks good

Anonymous
10/25/24(Fri)13:47:41 No.102970103

Anonymous 10/25/24(Fri)13:47:41 No.102970103

>>102969994
Chainmail fail is user error. Testing merges with Asp2.

Anonymous
10/25/24(Fri)13:51:12 No.102970155

Anonymous 10/25/24(Fri)13:51:12 No.102970155

>>102970103
ok true it usually looks better than that. Nonetheless regular repeating linked geometry like that isn't the strong suit of these models

Anonymous
10/25/24(Fri)13:52:35 No.102970172

Anonymous 10/25/24(Fri)13:52:35 No.102970172

File: 2x_upscale_video_00001.webm (446 KB, 864x480)

446 KB WEBM

>240p upscaled to 480
Prompt executed in 261.31 seconds, 5x faster than 480p

You can kind of get something at low resolutions. 240p is probably too low though.

Anonymous
10/25/24(Fri)13:56:50 No.102970229

Anonymous 10/25/24(Fri)13:56:50 No.102970229

>>102969994
Yeah just playing with Pixart

Anonymous
10/25/24(Fri)13:58:47 No.102970260

Anonymous 10/25/24(Fri)13:58:47 No.102970260

>>102969994
Chainmail and similar things are logic problems, same reason why rooms are nonsensical. The AI needs reasoning to understand why things are the way they are, why light switches are placed where they are, for example. It's a complex problem.

Anonymous
10/25/24(Fri)14:00:18 No.102970279

Anonymous 10/25/24(Fri)14:00:18 No.102970279

File: cm2p157em003z336ppyzotd1w.webm (1.44 MB, 1696x960)

1.44 MB WEBM

Starting to hit my monthly limit on some accounts

Anonymous
10/25/24(Fri)14:00:52 No.102970290

Anonymous 10/25/24(Fri)14:00:52 No.102970290

File: ComfyUI_00856_.png (1.36 MB, 1280x1024)

1.36 MB PNG

Anonymous
10/25/24(Fri)14:03:15 No.102970321

Anonymous 10/25/24(Fri)14:03:15 No.102970321

>>102970279
>some accounts
Hackermens!

Anonymous
10/25/24(Fri)14:12:38 No.102970404

Anonymous 10/25/24(Fri)14:12:38 No.102970404

>>102970290
I see a futuristic plasma rifle in it

Anonymous
10/25/24(Fri)14:12:47 No.102970405

Anonymous 10/25/24(Fri)14:12:47 No.102970405

>>102968708
Can Mochi be used as an image generator?

Anonymous
10/25/24(Fri)14:15:57 No.102970444

Anonymous 10/25/24(Fri)14:15:57 No.102970444

File: Mochi_00006.webm (597 KB, 376x688)

597 KB WEBM

Accidentally flipped the width and height, let it finish anyway, surprised it kind of worked.

Anonymous
10/25/24(Fri)14:18:42 No.102970477

Anonymous 10/25/24(Fri)14:18:42 No.102970477

The sd3.5 branch of kohya appears to work for lora training. I mean, it's running, and the loss looks decent. Anybody know if you're supposed to set the weighting_scheme to something other than the default? There's a bunch of different options that affect both the timestep sampling and the loss weighting.

Anonymous
10/25/24(Fri)14:19:24 No.102970487

Anonymous 10/25/24(Fri)14:19:24 No.102970487

>>102970444
flipper like that dolphin

Anonymous
10/25/24(Fri)14:23:07 No.102970538

Anonymous 10/25/24(Fri)14:23:07 No.102970538

now that google colab is not collaborating anymore, is there any way to use free sd2.5? (creating 100000 fake accounts on their site kinda sucks)

Anonymous
10/25/24(Fri)14:26:40 No.102970581

Anonymous 10/25/24(Fri)14:26:40 No.102970581

File: cm2p29gvh019y336pbiae4uoy.webm (1.5 MB, 1696x960)

1.5 MB WEBM

>>102970321
>Hackermens!
I was able to make more accounts thanks to my holiday SIM card
The retarded part is that it says 30/30 monthly gens when there's literally only 17 gens including moderated ones visible in my account

Anonymous
10/25/24(Fri)14:27:16 No.102970591

Anonymous 10/25/24(Fri)14:27:16 No.102970591

File: Mochi_00009.webm (394 KB, 640x376)

394 KB WEBM

Last one for now
>Prompt executed in 583.43 seconds
I think this resolution strikes a nice balance, between speed and quality for my 16gb card. Will test more later to see if I just got a lucky seed

Anonymous
10/25/24(Fri)14:30:04 No.102970629

Anonymous 10/25/24(Fri)14:30:04 No.102970629

>>102969965
Link the custom node, but it only has support for up to 4.10
https://github.com/Fannovel16/ComfyUI-Frame-Interpolation

Anonymous
10/25/24(Fri)14:31:39 No.102970648

Anonymous 10/25/24(Fri)14:31:39 No.102970648

>>102970629
Yeah I tried it, it's good for CogVideoX which is 8fps not much need for it with Mochi at 24fps

Anonymous
10/25/24(Fri)14:33:11 No.102970670

Anonymous 10/25/24(Fri)14:33:11 No.102970670

>>102970477
>>102970477
>Anybody know if you're supposed to set the weighting_scheme to something other than the default?
Welcome to the bleeding edge, where you already know more about this than everyone else in the world.
And when you're here, you see what you get and experiment by changing the defaults and tell other people about it, and they will know, because you told them.

Anonymous
10/25/24(Fri)14:33:36 No.102970675

Anonymous 10/25/24(Fri)14:33:36 No.102970675

>>102970648
24 FPS is enough to get stable output for 60 FPS video if that is what one wants. Not sure if the glitches would be amplified or not doing that.

Anonymous
10/25/24(Fri)14:34:31 No.102970682

Anonymous 10/25/24(Fri)14:34:31 No.102970682

>>102970538
>is there any way to use free sd2.5?
If you mean sd3.5 have at it:
https://huggingface.co/spaces/Nymbo/Stable-Diffusion-3.5-Large-Serverless

Anonymous
10/25/24(Fri)14:42:38 No.102970779

Anonymous 10/25/24(Fri)14:42:38 No.102970779

File: tmp24sm323i.png (1.16 MB, 1232x920)

1.16 MB PNG

Anonymous
10/25/24(Fri)14:47:26 No.102970849

Anonymous 10/25/24(Fri)14:47:26 No.102970849

so is Sd 3.5 a noticeable improvement to XL apart from the ability to do text?

Anonymous
10/25/24(Fri)14:56:28 No.102970943

Anonymous 10/25/24(Fri)14:56:28 No.102970943

>>102970849
T5 is much, much better for prompting.

Anonymous
10/25/24(Fri)14:59:48 No.102970988

Anonymous 10/25/24(Fri)14:59:48 No.102970988

File: cm2p2lojs01wm336p80yufmv9.webm (1.43 MB, 1696x960)

1.43 MB WEBM

>>102970648
Pretty sure mochi is 30fps?

>>102970675
I don't see a reason why interpolation would amplify glitches, it just tries to figure out an intermediate frame. Maybe if you're doing abstract stuff or explosions/particle effects
I have no idea what I'm talking about though

Anonymous
10/25/24(Fri)15:00:03 No.102970992

Anonymous 10/25/24(Fri)15:00:03 No.102970992

>>102970405
it only has 4 nodes. I am looking into to cracking one open to get to the image gen piece.

short answer, no, it can not

Anonymous
10/25/24(Fri)15:01:35 No.102971006

Anonymous 10/25/24(Fri)15:01:35 No.102971006

I have all the nodes. Assume the position

Anonymous
10/25/24(Fri)15:01:50 No.102971014

Anonymous 10/25/24(Fri)15:01:50 No.102971014

Any video model can be used as an image generator, you just extract whatever frame you want from the MP4 with ffmpeg and save it

Anonymous
10/25/24(Fri)15:02:48 No.102971023

Anonymous 10/25/24(Fri)15:02:48 No.102971023

File: download.png (116 KB, 750x690)

116 KB PNG

>>102970849
Is there a way to have a "general coloration", like this image is blue. I tried "color fog", but it doesn't work, with midjourney it were easy and worked quite nicely

Anonymous
10/25/24(Fri)15:04:51 No.102971040

Anonymous 10/25/24(Fri)15:04:51 No.102971040

File: tmp6mqcli57.png (1.28 MB, 1152x896)

1.28 MB PNG

Anonymous
10/25/24(Fri)15:06:48 No.102971068

Anonymous 10/25/24(Fri)15:06:48 No.102971068

It seems sd3 does not recognize anymore "by author"... So sd1 is still king after all

Anonymous
10/25/24(Fri)15:15:55 No.102971164

Anonymous 10/25/24(Fri)15:15:55 No.102971164

>>102971023
feed it a colored latent, lower your denoise slightly and maybe increase steps

Anonymous
10/25/24(Fri)15:19:49 No.102971208

Anonymous 10/25/24(Fri)15:19:49 No.102971208

I'm getting extremely confused with training. I've trained around 8 character loras and only 3 of then are generating decent results. I think it might be an issue with my captioning. How do you guys caption datasets of people? I've read mixed results and tried a few but the end result is always subpar...

Anonymous
10/25/24(Fri)15:21:51 No.102971230

Anonymous 10/25/24(Fri)15:21:51 No.102971230

>>102970988
Yeah, it does. I think it's because the workflow default goes to 24 FPS which is why it is confusing but it is quite clear from the blog post announcement.
https://www.genmo.ai/blog
>Motion Quality: Mochi 1 generates smooth videos at 30 frames per second

Anonymous
10/25/24(Fri)15:34:43 No.102971387

Anonymous 10/25/24(Fri)15:34:43 No.102971387

>sd3.5_large.safetensors
>sd3.5_large-Q[458]_[01].gguf
>sd3.5_large_fp8_scaled.safetensors
>above ones with "_turbo"
which one should I pick?

Anonymous
10/25/24(Fri)15:41:55 No.102971473

Anonymous 10/25/24(Fri)15:41:55 No.102971473

File: ComfyUI_Flux_14590.jpg (213 KB, 832x1216)

213 KB JPG

Anonymous
10/25/24(Fri)15:54:49 No.102971593

Anonymous 10/25/24(Fri)15:54:49 No.102971593

nice

Anonymous
10/25/24(Fri)16:02:40 No.102971672

Anonymous 10/25/24(Fri)16:02:40 No.102971672

very opposite opinion

Anonymous
10/25/24(Fri)16:03:08 No.102971677

Anonymous 10/25/24(Fri)16:03:08 No.102971677

File: image (18).png (435 KB, 1024x1024)

435 KB PNG

Anonymous
10/25/24(Fri)16:05:04 No.102971695

Anonymous 10/25/24(Fri)16:05:04 No.102971695

File: cm2p5gfvi00rv336u5htyydsq.webm (2.36 MB, 1696x960)

2.36 MB WEBM

"model" seems to help keep them older than pubescent but my sample size is only 2
>young Russian teenage model girl

>>102971230
>Yeah, it does. I think it's because the workflow default goes to 24 FPS which is why it is confusing
Shitty defaults and their consequences have been a disaster for the generative AI community

Anonymous
10/25/24(Fri)16:06:05 No.102971704

Anonymous 10/25/24(Fri)16:06:05 No.102971704

>>102971677
what model anon? I like the water effect

Anonymous
10/25/24(Fri)16:17:41 No.102971827

Anonymous 10/25/24(Fri)16:17:41 No.102971827

>>102969889
Probably to ensure that it was unmissably AI.

Anonymous
10/25/24(Fri)16:20:32 No.102971859

Anonymous 10/25/24(Fri)16:20:32 No.102971859

>>102971068
can't people just make loras for it?

Anonymous
10/25/24(Fri)16:21:42 No.102971873

Anonymous 10/25/24(Fri)16:21:42 No.102971873

File: 00357-2335055315.png (963 KB, 896x1152)

963 KB PNG

I've been trying flux models for a while but I still don't get the difference between distilled CFG and CFG, what is each one for

Anonymous
10/25/24(Fri)16:22:59 No.102971891

Anonymous 10/25/24(Fri)16:22:59 No.102971891

>>102971873
the other does the other thing

Anonymous
10/25/24(Fri)16:24:22 No.102971903

Anonymous 10/25/24(Fri)16:24:22 No.102971903

>>102971068
They had pictures on their dataset with "by author".
They threw their tags to the trash.
They used a Vision Language Model to create new tags, and it doesn't know any authors.
I claim they did this because they are retarded and don't know their value could have been outdo midjourney, which is still king at this.
Clit Eastwood was preferable.

Anonymous
10/25/24(Fri)16:25:39 No.102971917

Anonymous 10/25/24(Fri)16:25:39 No.102971917

File: 00358-3991456649.png (1.06 MB, 896x1152)

1.06 MB PNG

>>102971891

Anonymous
10/25/24(Fri)16:26:31 No.102971930

Anonymous 10/25/24(Fri)16:26:31 No.102971930

>>102970849
The ability to tell it where you want the things on the picture instead of relying on noise looking for what you asked for.

Anonymous
10/25/24(Fri)16:30:05 No.102971975

Anonymous 10/25/24(Fri)16:30:05 No.102971975

>>102971917
CFG does not exist in Dev because they distilled it and made it 1.0 the default and unchangeable.
Dedistill brings CFG back at the cost of everything else worth using Flux for.

Anonymous
10/25/24(Fri)16:32:51 No.102972008

Anonymous 10/25/24(Fri)16:32:51 No.102972008

press button to continue

Anonymous
10/25/24(Fri)16:34:22 No.102972025

Anonymous 10/25/24(Fri)16:34:22 No.102972025

File: 00360-2759304127.png (1.02 MB, 896x1152)

1.02 MB PNG

I'm trying out Verus Vision and holy fuck it's slow

Anonymous
10/25/24(Fri)16:35:25 No.102972039

Anonymous 10/25/24(Fri)16:35:25 No.102972039

File: 306243625737261058.webm (1.35 MB, 1264x720)

1.35 MB WEBM

wtf, I didn't type any prompt and it turned the fighter jet into a paper plane

Anonymous
10/25/24(Fri)16:51:07 No.102972194

Anonymous 10/25/24(Fri)16:51:07 No.102972194

File: 00365-2397504837.png (991 KB, 1152x896)

991 KB PNG

the results are good but damn is it slow

Anonymous
10/25/24(Fri)16:52:43 No.102972216

Anonymous 10/25/24(Fri)16:52:43 No.102972216

>>102968031
Maybe wikimedia commons? Upload them as public domain images?

Anonymous
10/25/24(Fri)16:53:15 No.102972222

Anonymous 10/25/24(Fri)16:53:15 No.102972222

File: ships_1.webm (2.13 MB, 856x480)

2.13 MB WEBM

>>102966229
the movement is so fucking good, and in my 100% expert anon opinion the problem's with the VAE which is the one that turns the latents into the image making the finer details and implementing them coherently.
I can also see the potential, in this gen you can see the cup of coffee, the coffee stains in the borders of the mug, the coffee moves realistically but then the ships, the part that moves and is detailed looks like shit, that's why I think it's a problem with the temporal part of the VAE. Im disgusted at how close to impressive these look. So close yet so far, how sad.

Anonymous
10/25/24(Fri)16:54:40 No.102972239

Anonymous 10/25/24(Fri)16:54:40 No.102972239

>>102972222
It looks like they are predicted based on motion vectors kind of like DLSS, probably not the VAE

Anonymous
10/25/24(Fri)16:56:15 No.102972254

Anonymous 10/25/24(Fri)16:56:15 No.102972254

File: 00361-2486161067.jpg (451 KB, 1280x1720)

451 KB JPG

Anonymous
10/25/24(Fri)16:57:30 No.102972262

Anonymous 10/25/24(Fri)16:57:30 No.102972262

Man, not being able to start with an already existing image in mochi 1 makes awful results.
And worst of all, you can only see that after 30-40mn of inference.

Anonymous
10/25/24(Fri)16:59:59 No.102972284

Anonymous 10/25/24(Fri)16:59:59 No.102972284

File: 00367-3805991773.png (886 KB, 1152x896)

886 KB PNG

Anonymous
10/25/24(Fri)17:15:36 No.102972437

Anonymous 10/25/24(Fri)17:15:36 No.102972437

File: 00062-547987379.png (1.38 MB, 832x1216)

1.38 MB PNG

Anonymous
10/25/24(Fri)17:16:55 No.102972452

Anonymous 10/25/24(Fri)17:16:55 No.102972452

>>102972254
thats a real drwawing

Anonymous
10/25/24(Fri)17:25:17 No.102972532

Anonymous 10/25/24(Fri)17:25:17 No.102972532

File: 00372-2471372132.png (880 KB, 896x1152)

880 KB PNG

Anonymous
10/25/24(Fri)17:29:29 No.102972573

Anonymous 10/25/24(Fri)17:29:29 No.102972573

>>102972262
>30-40mn
lucky, I wait 1.5h to get 121 frames at 80 steps

Anonymous
10/25/24(Fri)17:32:30 No.102972603

Anonymous 10/25/24(Fri)17:32:30 No.102972603

>Even Verus Vision is like 35gb
Please tell me there's a way to run this shit on CPU. My poor 3060 can't take this...

Anonymous
10/25/24(Fri)17:34:43 No.102972619

Anonymous 10/25/24(Fri)17:34:43 No.102972619

>>102972573
I wish it was worth it

Anonymous
10/25/24(Fri)17:35:01 No.102972626

Anonymous 10/25/24(Fri)17:35:01 No.102972626

>>102972603
8 or 12GB?

Anonymous
10/25/24(Fri)17:38:17 No.102972654

Anonymous 10/25/24(Fri)17:38:17 No.102972654

>>102972626
12gb.

Anonymous
10/25/24(Fri)17:39:57 No.102972677

Anonymous 10/25/24(Fri)17:39:57 No.102972677

>>102972654
I'm also running Verus on 12GB, but with a 4070 tho. At least memorywise it's doable

Anonymous
10/25/24(Fri)17:43:54 No.102972716

Anonymous 10/25/24(Fri)17:43:54 No.102972716

>>102972677
Oh, shit, really? Is it FP16 or FP8? How slow?

Anonymous
10/25/24(Fri)17:45:35 No.102972739

Anonymous 10/25/24(Fri)17:45:35 No.102972739

>>102972573
Using a 3090@300W, I get 5s at 24fps after 40mn.

Anonymous
10/25/24(Fri)17:47:40 No.102972759

Anonymous 10/25/24(Fri)17:47:40 No.102972759

>>102972716
FP8, genning time is a bit more than a minute at 1152x896

Anonymous
10/25/24(Fri)17:50:30 No.102972791

Anonymous 10/25/24(Fri)17:50:30 No.102972791

File: 2639166284.png (982 KB, 832x1216)

982 KB PNG

Anonymous
10/25/24(Fri)17:52:38 No.102972817

Anonymous 10/25/24(Fri)17:52:38 No.102972817

>>102972759
Mmm, I see. I hear the details suffer pretty noticeable changes going from FP16 -> FP8, is that true here, too? I've seen some comparisons with other models, it does seem like a bit of a drop.

Anonymous
10/25/24(Fri)17:58:56 No.102972894

Anonymous 10/25/24(Fri)17:58:56 No.102972894

>>102972817
I haven't tried out FP16 Verus and don't have that much experience with Flux yet so I can't tell really. But Verus seems to have fixed the plastic skin issue that plagues all other Flux models I've tried so far, but it's also super fucking slow compared to the ones I've tried before.

Anonymous
10/25/24(Fri)18:06:21 No.102972988

Anonymous 10/25/24(Fri)18:06:21 No.102972988

>>102972262
>Man, not being able to start with an already existing image in mochi 1 makes awful results.
Is it difficult to "patch" in image2video? I know that image2image from a text2image model is relatively simple since instead of just a random latent image full of noise you just start the gen with the image itself as the latent space. Why couldn't you just set the latent space of the first frame of the video model to an image in the same way?

Anonymous
10/25/24(Fri)18:09:30 No.102973029

Anonymous 10/25/24(Fri)18:09:30 No.102973029

>>102972988
>Why couldn't you just set the latent space of the first frame of the video model to an image in the same way?
Because nobody has produced code for this, of course it's doable, we need a coding genius to figure it out.
And this proves ChatGPT is a meme or you could ask it.

Anonymous
10/25/24(Fri)18:10:52 No.102973042

Anonymous 10/25/24(Fri)18:10:52 No.102973042

>>102972603
What? Flux on CPU? Willing to wait 2 hours for a 512x512 image?

Anonymous
10/25/24(Fri)18:12:49 No.102973068

Anonymous 10/25/24(Fri)18:12:49 No.102973068

>>102972039
Imagine it in reverse.

Anonymous
10/25/24(Fri)18:13:50 No.102973078

Anonymous 10/25/24(Fri)18:13:50 No.102973078

>>102972025
Only 3 images and I already hate the guts of that cat.

Anonymous
10/25/24(Fri)18:28:59 No.102973241

Anonymous 10/25/24(Fri)18:28:59 No.102973241

>>102971917
Cute!

Anonymous
10/25/24(Fri)18:43:01 No.102973372

Anonymous 10/25/24(Fri)18:43:01 No.102973372

seems like 100 steps is a solution for a lot of mochi quality ....issues

https://github.com/kijai/ComfyUI-MochiWrapper/issues/21

Anonymous
10/25/24(Fri)18:47:41 No.102973412

Anonymous 10/25/24(Fri)18:47:41 No.102973412

>>102973372
oh, I'll try it asap

Anonymous
10/25/24(Fri)18:51:10 No.102973438

Anonymous 10/25/24(Fri)18:51:10 No.102973438

>>102973372
>200 steps makes it even better
there is no cope settings with this one huh?

Anonymous
10/25/24(Fri)18:51:57 No.102973449

Anonymous 10/25/24(Fri)18:51:57 No.102973449

>>102973372
>>102973438
It's over...

Anonymous
10/25/24(Fri)18:56:53 No.102973495

Anonymous 10/25/24(Fri)18:56:53 No.102973495

is it possible to get any dynamic pose with flux? i wanna do fight poses with it but i always get static standing poses.

Anonymous
10/25/24(Fri)18:59:10 No.102973518

Anonymous 10/25/24(Fri)18:59:10 No.102973518

>>102973449
What? No, this is great, you can get the quality of the unreleased HD already at 480p, you just need to up the steps.
Imagine if you could get Flux quality on SD3.5 in 100 steps, people would abandon Flux in a heartbeat, but it reaches its limits in 40, maybe low steps video generation is just like Schnell, so it needs 100 steps to shine.

Anonymous
10/25/24(Fri)19:00:11 No.102973528

Anonymous 10/25/24(Fri)19:00:11 No.102973528

>>102973495
Um, not really, try the Kolors model for that, add the text in postproduction.

Anonymous
10/25/24(Fri)19:02:21 No.102973550

Anonymous 10/25/24(Fri)19:02:21 No.102973550

>>102973518
>45 minutes to generate a 4 second video on a 3090
>this is great

Anonymous
10/25/24(Fri)19:03:51 No.102973568

Anonymous 10/25/24(Fri)19:03:51 No.102973568

>>102973518
I think we all fell into the trap that 50 steps was the optimal point and what the model was good at, well, at least i did.
We've been spoiled with SD having small parameter ranges, it seems as you suggest, video models may require more iterations per frame.

Anonymous
10/25/24(Fri)19:03:53 No.102973569

Anonymous 10/25/24(Fri)19:03:53 No.102973569

>>102973495
try repeating yourself with different words and using full sentences.

Anonymous
10/25/24(Fri)19:05:14 No.102973588

Anonymous 10/25/24(Fri)19:05:14 No.102973588

>>102973550
It is what it is, I just batch them before going to sleep and that's it.
Until someone inevitably figures a way to accelerate that.

Anonymous
10/25/24(Fri)19:06:10 No.102973598

Anonymous 10/25/24(Fri)19:06:10 No.102973598

>>102973495
The model was fed with shit captions around movement/pose etc, so it's mostly chance based.

Anonymous
10/25/24(Fri)19:07:31 No.102973610

Anonymous 10/25/24(Fri)19:07:31 No.102973610

>>102973588
Text2video is just too random for it to be worth it, for me at least. If it was img2video that would be another story

Anonymous
10/25/24(Fri)19:09:21 No.102973628

Anonymous 10/25/24(Fri)19:09:21 No.102973628

>>102973610
Oh I agree, I think the mochi devs said something about img2video, if they release an updated code with it, it'll be so much simpler to prompt it.

Anonymous
10/25/24(Fri)19:10:46 No.102973639

Anonymous 10/25/24(Fri)19:10:46 No.102973639

>>102973550
>oh god, I wish this technology never existed and that there was no way for me to do this because now I have to wait 45 minutes.

Anonymous
10/25/24(Fri)19:11:29 No.102973644

Anonymous 10/25/24(Fri)19:11:29 No.102973644

>>102973495
controlnet

Anonymous
10/25/24(Fri)19:13:47 No.102973667

Anonymous 10/25/24(Fri)19:13:47 No.102973667

>>102973628
>if they release an updated code with it, it'll be so much simpler to prompt it
Neither img2img or ControlNet weren't added by the people that released Stable Diffusion, they were added by the community, if instead of waiting for mochi devs to implement it someone did it we'd already have it.
It's rocket science but we have rocket scientists.

Anonymous
10/25/24(Fri)19:16:54 No.102973702

Anonymous 10/25/24(Fri)19:16:54 No.102973702

File: 1722630127970299.png (101 KB, 2015x675)

101 KB PNG

>>102973667
see picrel

Anonymous
10/25/24(Fri)19:21:36 No.102973748

Anonymous 10/25/24(Fri)19:21:36 No.102973748

Is there a browser for the rereleased LAION dataset yet?
https://laion.ai/blog/relaion-5b/

Anonymous
10/25/24(Fri)19:22:36 No.102973761

Anonymous 10/25/24(Fri)19:22:36 No.102973761

File: Mochi_00004.webm (1.15 MB, 864x488)

1.15 MB WEBM

>Prompt executed in 1444.05 seconds
100 steps. Only 60 frames tho
16gb VRAM

Anonymous
10/25/24(Fri)19:22:41 No.102973763

Anonymous 10/25/24(Fri)19:22:41 No.102973763

>>102973702
Interested in Mochi stills generator. anon said he's looking into how to do it.

Anonymous
10/25/24(Fri)19:26:53 No.102973806

Anonymous 10/25/24(Fri)19:26:53 No.102973806

Man, seeing more big youtubers make anti-ai videos is so weird when AI capability is constantly improving. It's like two parallel worlds.

Godot reddit community just banned AI too.

Anonymous
10/25/24(Fri)19:30:36 No.102973844

Anonymous 10/25/24(Fri)19:30:36 No.102973844

>>102973806
If reddit is against it, it's probably the truth.

Anonymous
10/25/24(Fri)19:30:51 No.102973847

Anonymous 10/25/24(Fri)19:30:51 No.102973847

File: ComfyUI_02642_.png (1.31 MB, 1024x1024)

1.31 MB PNG

>>102973806
why are so many people against AI and getting so Butthurt over it?
did people also chimp out like that when Photoshop became a thing?
shouldnt people be happy that more tools exist now that can help one make great content?

Anonymous
10/25/24(Fri)19:31:51 No.102973855

Anonymous 10/25/24(Fri)19:31:51 No.102973855

>>102973847
>why are so many people against AI and getting so Butthurt over it?
They're uninformed mostly.

Anonymous
10/25/24(Fri)19:34:18 No.102973874

Anonymous 10/25/24(Fri)19:34:18 No.102973874

>>102973847
>why are so many people against AI
It's scary.

Anonymous
10/25/24(Fri)19:37:36 No.102973914

Anonymous 10/25/24(Fri)19:37:36 No.102973914

>>102973847
>why are so many people against AI and getting so Butthurt over it?
Every other video is about moral panics around ai, no surprise the result is this + extreme way over the top hype that sold something that doesn't exist, aka perfect flawless ai.
And fucking openai that cannot stfu about selling their stuff like it's nuclear weaponry to increase the hype and get more investors.

>did people also chimp out like that when Photoshop became a thing?
Yes but it was way more subdued since social media weren't amplifying the thing left and right.

>shouldnt people be happy that more tools exist now that can help one make great content?
Obviously, but just look at how misinformed the average normie is around this.
What a normie sees every day are videos on how ai is evil, and another gazillion shitty ai made youtube videos/shorts with synthetic voices there to cash in easy money.

Anonymous
10/25/24(Fri)19:38:48 No.102973930

Anonymous 10/25/24(Fri)19:38:48 No.102973930

File: ComfyUI_temp_vctya_00004_.jpg (851 KB, 2496x1924)

851 KB JPG

im finally satisfied with the state of a gigachad lora, just training the base illustrious version (i gen on noob 0.5 and trained first for it) and ill have a full post in an hour or so

Anonymous
10/25/24(Fri)19:40:23 No.102973942

Anonymous 10/25/24(Fri)19:40:23 No.102973942

File: 2024-10-25-175208_noobaiX(...).jpg (636 KB, 1344x1728)

636 KB JPG

>>102973930
i was really quite pleased with how it stabilized dynamic poses, which made me put in extra effort to deliver a clean 1.0 version

Anonymous
10/25/24(Fri)19:41:50 No.102973955

Anonymous 10/25/24(Fri)19:41:50 No.102973955

>>102973847
Yes people really were butthurt when people started using tablets to digitally paint. They used the exact same "you're not a real artist" argument, said to professional artists that were switching from easel to iPad.

Anonymous
10/25/24(Fri)19:42:45 No.102973964

Anonymous 10/25/24(Fri)19:42:45 No.102973964

>>102973914
>What a normie sees every day are videos on how ai is evil, and another gazillion shitty ai made youtube videos/shorts with synthetic voices there to cash in easy money.
it makes perfect sense to see people holding the simultaneous idea that ai is super dangerous to everything while at the same so useless no one will use it lol

Anonymous
10/25/24(Fri)19:44:37 No.102973978

Anonymous 10/25/24(Fri)19:44:37 No.102973978

>>102973955
>Everything that was made before I was a teen is normal and part of life, everything invented after is dangerous and should be heavily controlled or banned.
Classic.

Anonymous
10/25/24(Fri)19:46:20 No.102973993

Anonymous 10/25/24(Fri)19:46:20 No.102973993

>>102973978
I just think it's hilarious that these people write about ebil AI on the internet, the single most disruptive thing to ever exist that has cost millions of people their jobs.

Anonymous
10/25/24(Fri)19:46:40 No.102973995

Anonymous 10/25/24(Fri)19:46:40 No.102973995

>>102973955
That's different since it's direct input vs directing. You aren't the artist, the AI is, you are merely trying to guide it to get the output. Using AI to generate images is more like you comissioning an artist, only that the artist is an AI in that case.

Anonymous
10/25/24(Fri)19:47:22 No.102974004

Anonymous 10/25/24(Fri)19:47:22 No.102974004

smells like 2023 in here

Anonymous
10/25/24(Fri)19:49:39 No.102974022

Anonymous 10/25/24(Fri)19:49:39 No.102974022

>>102974004
that was my fart

Anonymous
10/25/24(Fri)19:50:56 No.102974030

Anonymous 10/25/24(Fri)19:50:56 No.102974030

>>102973995
Distinction without a difference, same arguments every time, I don't care after that. The fact is digital painting made art more accessible which means less commissions and less art supplies being bought thus costing people jobs. It's just a removal of a huge barrier to entry because traditional media costs a shit ton of money thus if any retard can buy a $200 tablet and paint it threatens the old guard. The end.

Anonymous
10/25/24(Fri)19:53:47 No.102974054

Anonymous 10/25/24(Fri)19:53:47 No.102974054

>>102974030
There are literally professional artists working right now that have never used traditional media. Let that sink in. Just as there will be AI artists that will work professionally that have never picked up a digital pen. Don't seethe because people have removed the need to use a crude tool to draw a picture.

Anonymous
10/25/24(Fri)19:57:49 No.102974082

Anonymous 10/25/24(Fri)19:57:49 No.102974082

I literally am an artist tho

Anonymous
10/25/24(Fri)19:57:49 No.102974083

Anonymous 10/25/24(Fri)19:57:49 No.102974083

>>102974030
Current AI generators don't allow the translation of ideas and the level of fine tuning that a skilled artist has by simply drawing by hand.As long as AI does not allow the same amount of fine control it will not a replacement of an actual hands on creative process. Once we have art generators that allow this level of fine tuning in real time it will be superior/equal to hand drawn art, but the current descriptive approach will not be able to do that.

Anonymous
10/25/24(Fri)19:59:49 No.102974097

Anonymous 10/25/24(Fri)19:59:49 No.102974097

>>102974083
If I generate a 1000 images refining a prompt until I get something I like, I'm an artist because I ultimately made the final picture. Feel free to seethe all day about it.

Anonymous
10/25/24(Fri)20:01:26 No.102974113

Anonymous 10/25/24(Fri)20:01:26 No.102974113

>>102974022
open a window the smell is getting worse

Anonymous
10/25/24(Fri)20:02:20 No.102974122

Anonymous 10/25/24(Fri)20:02:20 No.102974122

>>102974097
I do both genning and drawing and they are different things, and genning does not allow the amount of creative expression that drawing by hand does. It's its own thing, but as of now it's closer to photography than drawing and painting, as in taking a rough concept and the nrearranging it unti lyou get something you like.

Anonymous
10/25/24(Fri)20:05:38 No.102974154

Anonymous 10/25/24(Fri)20:05:38 No.102974154

>>102974122
As someone with an art degree and have done multiple paintings, I completely disagree. And as the AI gets more complex in the ability to take direction and refine a picture through words or annotations, even that point won't be valid. But it doesn't matter, we've already established that randomness is art. If you throw paint randomly until you have a final composition that you like, that is art. No different than hitting the generate button.

Anonymous
10/25/24(Fri)20:07:06 No.102974164

Anonymous 10/25/24(Fri)20:07:06 No.102974164

File: u4874.png (154 KB, 860x997)

154 KB PNG

>>102974154
>As someone with an art degree and have done multiple paintings,

Anonymous
10/25/24(Fri)20:09:56 No.102974185

Anonymous 10/25/24(Fri)20:09:56 No.102974185

>>102974097
>I downloaded a book I'm an author

Anonymous
10/25/24(Fri)20:11:50 No.102974199

Anonymous 10/25/24(Fri)20:11:50 No.102974199

Trying that 100 steps for a video idea, my gpu is a space heater constantly spewing out 300W, thankfully it's becoming cold so it's kind of useful.

Anonymous
10/25/24(Fri)20:12:30 No.102974206

Anonymous 10/25/24(Fri)20:12:30 No.102974206

is there any recommended lora for flux?

Anonymous
10/25/24(Fri)20:12:45 No.102974208

Anonymous 10/25/24(Fri)20:12:45 No.102974208

did someone piss of /ic/ or something

Anonymous
10/25/24(Fri)20:15:07 No.102974221

Anonymous 10/25/24(Fri)20:15:07 No.102974221

>>102974185
>I sprayed diarrhea from my asshole on a canvas. I'm an artist.

Anonymous
10/25/24(Fri)20:18:16 No.102974241

Anonymous 10/25/24(Fri)20:18:16 No.102974241

>>102974199
I had marginal success on 1 test, it's better but my scene was inhently blurry due to the prompt, it's late here and i've gone stupid.
I will have a crack at it tomorrow if someone hasn't coded img2vid by then lol

Anonymous
10/25/24(Fri)20:18:37 No.102974244

Anonymous 10/25/24(Fri)20:18:37 No.102974244

File: neverever.png (14 KB, 457x121)

14 KB PNG

>>102973702
Let me show you picrel from here:
https://huggingface.co/playgroundai/playground-v2.5-1024px-aesthetic
That was promised in February 16th, and we are still waiting.
Don't trust anyone when they talk about their plans about the future, eat your chocolate now because the scientist that promised you two pieces may never come back.
People should assume they'll never provide image-to-video for mochi and do it themselves, or we could be asking where is it 8 months from now.

Anonymous
10/25/24(Fri)20:20:27 No.102974257

Anonymous 10/25/24(Fri)20:20:27 No.102974257

>>102974241
>someone hasn't coded img2vid by then lol
sometime autism is surprising so who knows

Anonymous
10/25/24(Fri)20:21:13 No.102974266

Anonymous 10/25/24(Fri)20:21:13 No.102974266

>>102974206
I strongly recommend an artstyle lora (just download a bunch and pick your favorite) because Flux sucks at style. Also loras decrease generation speed quite a lot so you might want to instead merge it or use someone's checkpoint.

The Turbo Alpha lora is good for speeding up your gens while you refine your prompt.

Anonymous
10/25/24(Fri)20:21:48 No.102974270

Anonymous 10/25/24(Fri)20:21:48 No.102974270

>>102973847
>did people also chimp out like that when Photoshop became a thing?
There's the concept of hard work, that anybody deserves what they have if they worked hard for it, learning how to use Photoshop requires skill so it's respected, someone outdoing a skilled photoshopper by clicking a button will not.
The same happened when photography was invented, because people assumed making a great photo didn't require hard work.

Anonymous
10/25/24(Fri)20:22:38 No.102974277

Anonymous 10/25/24(Fri)20:22:38 No.102974277

>>102974266
thanks anon

Anonymous
10/25/24(Fri)20:24:00 No.102974288

Anonymous 10/25/24(Fri)20:24:00 No.102974288

>>102973761
local video models should be focused in img2video and deliver quality in smaller resolutions to save computing power, that is the only way to compete against cloud services

>>102973847
AI feels bigger than photoshop, the photographic camera, any new technology so far

Anonymous
10/25/24(Fri)20:25:59 No.102974302

Anonymous 10/25/24(Fri)20:25:59 No.102974302

>>102974244
sure but I still hope they deliver lol
either way I'll be happy

Anonymous
10/25/24(Fri)20:26:20 No.102974303

Anonymous 10/25/24(Fri)20:26:20 No.102974303

>>102974122
>genning does not allow the amount of creative expression that drawing by hand does
It does, you're just not using the tools right.
>I can't figure how to express my ideas creatively with AI generation, so nobody else can, so nobody does

Anonymous
10/25/24(Fri)20:26:46 No.102974308

Anonymous 10/25/24(Fri)20:26:46 No.102974308

>>102974266
>The Turbo Alpha lora is good for speeding up your gens while you refine your prompt.
I should've added - it's good for speed but it does reduce quality a little.
So it's for writing your prompt, but not for finalization.

Anonymous
10/25/24(Fri)20:27:45 No.102974313

Anonymous 10/25/24(Fri)20:27:45 No.102974313

>>102974257
If it's not there tomorrow morning EU hours i'm going to frustrate myself using a new install and env and gpt4o and try to lolzcode it. I've not coded anything since 68000 assembly. so like, don't depend on me.

Anonymous
10/25/24(Fri)20:32:02 No.102974350

Anonymous 10/25/24(Fri)20:32:02 No.102974350

>>102974308
How much faster is it anon? Compared to just go with fewer steps and a smaller resolution?

Anonymous
10/25/24(Fri)20:32:42 No.102974354

Anonymous 10/25/24(Fri)20:32:42 No.102974354

>>102974288
>AI feels bigger than photoshop, the photographic camera, any new technology so far
Only for people that spent decades of their life mastering their craft only to see other people outdoing them with some text and a click.
This isn't really bigger than mass production of chairs, that you spent so much time learning hoe to make the most comfortable one and now nobody will buy it because the fabric made 100 ones at 1/10 of the price and people don't know the difference because they haven't sat in yours, and even then, they'd rather save the money to get something else, so you wasted your life.
It's just that most people didn't live in the industrial mass production revolution, so this feels larger than life.

Anonymous
10/25/24(Fri)20:32:48 No.102974356

Anonymous 10/25/24(Fri)20:32:48 No.102974356

>>102974221
True anon if you put it this way then yes you're also an artist

Anonymous
10/25/24(Fri)20:32:54 No.102974357

Anonymous 10/25/24(Fri)20:32:54 No.102974357

>>102974313
well good luck, someone having fun with this will hopefully figure it out, the model without img2video is kind of sad after all

Anonymous
10/25/24(Fri)20:38:52 No.102974397

Anonymous 10/25/24(Fri)20:38:52 No.102974397

>>102974221
Using ai is like being a manager. It's not easy, but it's not really art. Only metaphorically.

Anonymous
10/25/24(Fri)20:45:49 No.102974457

Anonymous 10/25/24(Fri)20:45:49 No.102974457

>>102974397
It can be art, but you're closer a movie director than a graphical artist. I don't get why some people are so adamant about pitting it against handdrawn art, despite some outputs looking similar they're entitely different processes

Anonymous
10/25/24(Fri)20:47:28 No.102974475

Anonymous 10/25/24(Fri)20:47:28 No.102974475

goddamn you are fucking retarded anon

Anonymous
10/25/24(Fri)20:49:18 No.102974490

Anonymous 10/25/24(Fri)20:49:18 No.102974490

>>102974475
gonna cry?

Anonymous
10/25/24(Fri)20:50:15 No.102974495

Anonymous 10/25/24(Fri)20:50:15 No.102974495

is brap posting still a thing?

Anonymous
10/25/24(Fri)20:51:20 No.102974502

Anonymous 10/25/24(Fri)20:51:20 No.102974502

>>102974495
Brapposting is eternal

Anonymous
10/25/24(Fri)20:52:00 No.102974505

Anonymous 10/25/24(Fri)20:52:00 No.102974505

>>102974397
You can use AI to enhance what you created manually or as an assistant.

Anonymous
10/25/24(Fri)20:53:39 No.102974515

Anonymous 10/25/24(Fri)20:53:39 No.102974515

>>102974505
promptlet

Anonymous
10/25/24(Fri)20:55:41 No.102974530

Anonymous 10/25/24(Fri)20:55:41 No.102974530

File: NotArt.png (1.92 MB, 1360x768)

1.92 MB PNG

>>102974457
I don't get why people care so much about being considered artists, hampart is a kind of art done exclusively for profits, the whole NFT Monkeys fiasco was the peak of it, so not all art is good and not all artists are good so it's not necessarily a good term to be associated with.
I spent hours working on picrel, it's not art? Good! Who cares about that.

Anonymous
10/25/24(Fri)20:57:37 No.102974542

Anonymous 10/25/24(Fri)20:57:37 No.102974542

>>102974530
i've always seen it just as gacha rolling for cool looking pictures.

Anonymous
10/25/24(Fri)20:58:30 No.102974550

Anonymous 10/25/24(Fri)20:58:30 No.102974550

It doesn't matter what someone else thinks. All that matters is how you feel on the inside. How do you feel on the inside, anon?

Anonymous
10/25/24(Fri)20:59:24 No.102974557

Anonymous 10/25/24(Fri)20:59:24 No.102974557

>>102974515
???

Anonymous
10/25/24(Fri)21:00:07 No.102974565

Anonymous 10/25/24(Fri)21:00:07 No.102974565

>>102974550
>felt recoil
>recoilless
>soulless space shooter

Anonymous
10/25/24(Fri)21:01:09 No.102974576

Anonymous 10/25/24(Fri)21:01:09 No.102974576

File: 2024-10-25_00021_.png (982 KB, 720x1280)

982 KB PNG

Anonymous
10/25/24(Fri)21:03:42 No.102974598

Anonymous 10/25/24(Fri)21:03:42 No.102974598

>>102973942
Not bad

Anonymous
10/25/24(Fri)21:04:24 No.102974606

Anonymous 10/25/24(Fri)21:04:24 No.102974606

>>102974530
As said it can be art, but it's not hand painted art just as it is not a photography orca theater play

Anonymous
10/25/24(Fri)21:06:36 No.102974631

Anonymous 10/25/24(Fri)21:06:36 No.102974631

>>102974530
I mean, I could doodle, paint, produce music, etc. before AI so when I tell someone that (in real life) they usually stop saying "you're not an artist" since I mainly use genai now.

Anonymous
10/25/24(Fri)21:08:18 No.102974642

Anonymous 10/25/24(Fri)21:08:18 No.102974642

>>102974606
until the computer can make pretty pretty pictures without needing a human to prompt it the human is the artist simple as

Anonymous
10/25/24(Fri)21:08:20 No.102974643

Anonymous 10/25/24(Fri)21:08:20 No.102974643

>>102974606
art really requires an artist, and ai art is artistless. therefore ai art is artless. They are images. This distinction is important to understand, but while philosophically accurate, what I said won't shape the vernacular, people will say ai artist and ai art, though neither exist.

Anonymous
10/25/24(Fri)21:09:21 No.102974656

Anonymous 10/25/24(Fri)21:09:21 No.102974656

>>102974642
I used an LLM to generate this prompt:
>>102974576

>... a still life featuring a wicker basket overflowing with a variety of juicy, ripe apples, set against a warm, golden background that evokes a sense of autumnal coziness. The apples should be the main focus of the image, with some of them spilling out of the basket and onto the surrounding surface. Incorporate some subtle, natural textures and shading to give the image a realistic, tactile feel. The overall mood of the image should be inviting and appetizing, making the viewer want to reach out and grab an apple.

Anonymous
10/25/24(Fri)21:10:15 No.102974664

Anonymous 10/25/24(Fri)21:10:15 No.102974664

>>102974642
When you comission someone to draw a picture for you are then you the artist?

Anonymous
10/25/24(Fri)21:10:19 No.102974666

Anonymous 10/25/24(Fri)21:10:19 No.102974666

>>102974656
you still had to give the llm instructions, no matter how vague
the image didn't arise out of the ether

Anonymous
10/25/24(Fri)21:10:23 No.102974667

Anonymous 10/25/24(Fri)21:10:23 No.102974667

File: 2024-10-25_00023_.png (1.01 MB, 720x1280)

1.01 MB PNG

>>102974576

Anonymous
10/25/24(Fri)21:13:33 No.102974697

Anonymous 10/25/24(Fri)21:13:33 No.102974697

>>102974664
it can't be like commissioning a human if the computer is not sentient. again, we had this discourse over a year ago once AI started to actually look like what it was trained on.

if you say "prompting is just commissioning" they you must also say "the computer is just a human"

Anonymous
10/25/24(Fri)21:16:45 No.102974737

Anonymous 10/25/24(Fri)21:16:45 No.102974737

>>102974697
>muh sentience
meaningless word. you can absolutelly make prompting an ai be like commissioning a human, without making the ai sentient.

Anonymous
10/25/24(Fri)21:18:22 No.102974753

Anonymous 10/25/24(Fri)21:18:22 No.102974753

>>102974697
The computer isn't a human of course, but it is the agent that creates the image. It fulfills the same role as a human artist here, it takes your instructions to turn them into an image. You are not involved in the creatipn of the image besonders giving instructions. So, if you aren't the artist when comissioning a painting why would you suddenly be the artist when comissioning it from an AI?

Anonymous
10/25/24(Fri)21:18:23 No.102974754

Anonymous 10/25/24(Fri)21:18:23 No.102974754

>>102974643
Nothing you said is accurate. The prompter is the artist.

Anonymous
10/25/24(Fri)21:19:28 No.102974762

Anonymous 10/25/24(Fri)21:19:28 No.102974762

>>102974753
>you are not involved in the creation of the image besides being involved in the creation of the image
Your concession has been accepted

Anonymous
10/25/24(Fri)21:21:52 No.102974796

Anonymous 10/25/24(Fri)21:21:52 No.102974796

>>102974737
>>102974753
i dont want to accuse you of not interacting with AI at the same level as myself and others ITT but i am struggling to come up with another explanation for this phenomenon
if you are under the impression that the only way's to manipulate latent space are through words then you have much to catch up on
in the end, >>102974550 is correct and none of this matters. if one person says they're an artist when they use it and other says they themselves are not - neither can exude that onto the other.
just gen, anon

Anonymous
10/25/24(Fri)21:22:43 No.102974803

Anonymous 10/25/24(Fri)21:22:43 No.102974803

>>102974762
So someone who comissions a painter to draw a painting for them is the artist who painted the image?

Anonymous
10/25/24(Fri)21:23:45 No.102974816

Anonymous 10/25/24(Fri)21:23:45 No.102974816

New

>>102974813
>>102974813
>>102974813

Anonymous
10/25/24(Fri)21:25:20 No.102974844

Anonymous 10/25/24(Fri)21:25:20 No.102974844

>>102974796
Did you accidentally include my post in there? You did not bolster your use of the concept of sentience.

Anonymous
10/25/24(Fri)21:26:27 No.102974860

Anonymous 10/25/24(Fri)21:26:27 No.102974860

>>102974803
A traditional comissioner? No
A comissioner who is sitting right there the entire time directing the painter? Yes, how is this even a question.

Are directors not artists?

Anonymous
10/25/24(Fri)21:27:03 No.102974869

Anonymous 10/25/24(Fri)21:27:03 No.102974869

>>102974844
>you can absolutelly make prompting an ai be like commissioning a human, without making the ai sentient.
okay, so do it tell me how

Anonymous
10/25/24(Fri)21:28:10 No.102974876

Anonymous 10/25/24(Fri)21:28:10 No.102974876

>>102974869
>Technology will never be like that because I can't imagine it
And other things people have said throughout history, only to be proven wrong every time.

Anonymous
10/25/24(Fri)21:29:05 No.102974886

Anonymous 10/25/24(Fri)21:29:05 No.102974886

>>102974666
No, you don't. It can generate text based on the random seed.

Anonymous
10/25/24(Fri)21:31:14 No.102974916

Anonymous 10/25/24(Fri)21:31:14 No.102974916

>>102974886
And monkeys can write shakespeare. What the fuck is your point?

You do realize that the technology is irrelevant to the fact that it does in fact translate human intent into something else?

Anonymous
10/25/24(Fri)21:31:14 No.102974918

Anonymous 10/25/24(Fri)21:31:14 No.102974918

>>102974876
never said that it wont be in the future. i said it's clear that it's not right now. so answer my question
>>102974886
fair, however i'd argue that your choice of LLM, image model, etc still means you are the ultimate decider. i'll only agree with you once we have literal robots who are functioning members of society

Anonymous
10/25/24(Fri)21:32:25 No.102974935

Anonymous 10/25/24(Fri)21:32:25 No.102974935

>>102974918
>so answer my question
Ask one
>durr how do am thing work
is not a question its a piss poor rhetorical

Anonymous
10/25/24(Fri)21:33:27 No.102974943

Anonymous 10/25/24(Fri)21:33:27 No.102974943

>>102974935
>Ask one
>>102974869
>>you can absolutelly make prompting an ai be like commissioning a human, without making the ai sentient.
>okay, so do it tell me how

im beginning to think you're being disingenuous, anon

Anonymous
10/25/24(Fri)21:51:09 No.102975103

Anonymous 10/25/24(Fri)21:51:09 No.102975103

>>102974943
retard lmao

Anonymous
10/25/24(Fri)21:53:23 No.102975115

Anonymous 10/25/24(Fri)21:53:23 No.102975115

>>102974918
And what if humans cease to be functioning members of robot society?

Anonymous
10/25/24(Fri)21:55:17 No.102975136

Anonymous 10/25/24(Fri)21:55:17 No.102975136

>>102975103
come back when you learn how to do more than just prompt
>>102975115
good question

Anonymous
10/25/24(Fri)22:05:38 No.102975226

Anonymous 10/25/24(Fri)22:05:38 No.102975226

Can Flux do actual recognizable people yet? I tried like two weeks ago and it couldn't even get a celebrity's hair color right. Sure as fuck knows what Geralt looks like though.

I'm currently convinced they nuked all popular celebrities from their training sets to dodge controversy.

Anonymous
10/25/24(Fri)22:24:28 No.102975388

Anonymous 10/25/24(Fri)22:24:28 No.102975388

>>102975136
reading comprehension

Anonymous
10/25/24(Fri)23:32:25 No.102975959

Anonymous 10/25/24(Fri)23:32:25 No.102975959

File: image (1).png (518 KB, 1024x1024)

518 KB PNG

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.