[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: the longest dick general.jpg (2.93 MB, 2433x3264)
2.93 MB
2.93 MB JPG
Discussion of free and open source text-to-image models

Previous /ldg/ bred : >>102934088

Sigmas Edition

>Beginner UI
Fooocus: https://github.com/lllyasviel/fooocus
EasyDiffusion: https://easydiffusion.github.io
Metastable: https://metastable.studio

>Advanced UI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
reForge: https://github.com/Panchovix/stable-diffusion-webui-reForge
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://aitracker.art
https://huggingface.co
https://civitai.com
https://tensor.art/models
https://liblib.art
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3

>SD3 Large
https://huggingface.co/stabilityai/stable-diffusion-3.5-large
https://replicate.com/stability-ai/stable-diffusion-3.5-large

>Sana
https://github.com/NVlabs/Sana
https://8876bd28ee2da4b909.gradio.live

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux
DeDistilled Quants: https://huggingface.co/TheYuriLover/flux-dev-de-distill-GGUF/tree/main

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest

>Maintain thread quality
https://rentry.org/debo

>Related boards
>>>/aco/sdg
>>>/aco/aivg
>>>/b/degen
>>>/c/kdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/tg/slop
>>>/trash/sdg
>>>/u/udg
>>>/vt/vtai
>>
File: file.webm (1005 KB, 848x480)
1005 KB
1005 KB WEBM
https://github.com/kijai/ComfyUI-MochiWrapper
>Hatsune Miku skateboarding in New York
lawl, with the actual settings I can't go further than 19frames (that takes me 20 gb of vram), but it's fast to render though (19 frames, 50 steps, 4.72s/it, 03:55m), I'm sure it can be optimized, this is what he said on his Github
>so far highest I've done is 97 with the default tile size 2x2 grid.
>>
couldn't think of an idea to gen last night so I just did more Indian girls. will post some when I get to my computer
>>
https://www.youtube.com/watch?v=chqcGWym5d0

More Eve
>>
File: its_over_were_back.png (910 KB, 1144x1186)
910 KB
910 KB PNG
>finally get official Mochi scripts running on 4x4090
>have to tweak the code in various places, delete T5 and DiT after they're used so it doesn't OOM
>still can't run at full frame length because the VAE uses so much memory
>but, it works, and uses all GPUs in parallel to gen relatively fast
>try coomer prompt
>https://files.catbox.moe/t6276z.mp4
B-bros?
>>
>>102941085
did you actually buy 4 one thousand dollar graphics cards to generic a short noisy gif of some bouncy boobies?
>>
>>102941085
you can go for that one anon
https://github.com/kijai/ComfyUI-MochiWrapper

Nice video btw :^), what coomer prompt you used?
>>
File: bComfyUI_132642_.jpg (2.11 MB, 3072x1536)
2.11 MB
2.11 MB JPG
>>102941110
no he made a piss lora too if it's the same guy i think it is
>>
>>102941069
Can you post that image that is shown at 11 seconds?
>>
>>102941110
he's thinking long term anon, we'll get the HD version soon
https://www.genmo.ai/blog
>>Today, we are releasing our 480p base model, with Mochi 1 HD coming later this year.
>>
>>102941110
No, I built the machine mainly to finetune LLMs, several of which I've published on HF and gotten fairly popular. I'm fairly rich, it wasn't a huge cost to me, and would have been worth it even if it cost double what it did.
>>102941115
"A young brunette woman stands in a bedroom, facing the camera. She's wearing jean shorts and is topless, her breasts and nipples exposed. She's swaying and dancing seductively."
>>102941123
That's another anon. I think there's a few with 4+ 3090s / 4090s (at least in /lmg/ there is).
>>
>>102941085
amazing, if mochi had img2video that would be the best really
>>
>>102940995
Something is strange with your OS/env/setup.

80% Vram (idle is about 7-10%) on 16GB for 49 frames with clip on the "cpu" @33% of 64GB Ram during Sampler phase, and vae tiling decode=on is at 45% Vram during decode.
>>
>>102941014
do druggies
>>
>>102940941
that chubby reimu!!! aaaaaa!!! mnhooooooooohhhhhhhh!!!!!!!!
>>
>>102941202
>That's another anon
oh yeah i think he had 4x3090s. cool stuff though, do you have anymore vids?
>>
File: file.png (185 KB, 2794x1075)
185 KB
185 KB PNG
https://github.com/kijai/ComfyUI-MochiWrapper/blob/main/examples/mochi_test_163_frames_01.json
can't use "enamble_vae_tilting" I got this error
>>
>>102941375
https://files.catbox.moe/wprx0v.mp4
Tried to modify the prompt to get her to take off her shirt, seems to struggle a bit, or maybe I need more gacha rolls. Even with the combined power of 4 4090s it takes a while for each gen.
>>
>>102941085
>https://files.catbox.moe/t6276z.mp4
Okay, I expected that local quality by the end of 2025.
It's actually impressive, I now find things impressive depending on how long I was expecting them to take.
"one year earlier" and it could have fooled me into thinking it was a real video.
>>
>>102941202
>I've published on HF and gotten fairly popular.
Cool. Some rp models?
>>
>>102941243
Open source, even my kitty could make it do img2video but she's sleeping and gets grumpy if she doesn't have her full sleep day.
>>
>>102941486
kek good shit, how long does it take you to gen with 4x4090s?
>>
>>102941506
>>https://files.catbox.moe/t6276z.mp4
that legit looks like a 80's porn, insane we can run this shit on our computer now
>>
>>102941202
>I'm fairly rich
Aren't you that anon that could give us Local Dalle 3 if he wanted but, nah, we're not entitled and stuff like that?
>>
File: file.png (16 KB, 2782x96)
16 KB
16 KB PNG
>>102941554
nta but I'm trying 60 fps + 50 steps on my 3090 and it's taking a bit more than 10 mn
>>
>>102941579
>60 fps
*60 frames
gaddamit
>>
>>102941554
It's 10s per step, plus all the overhead of starting up, loading the models, and the VAE at the end.
>>102941569
lol i'm not that rich. I make a bit over 200k, so a 10k machine for my hobbies is reasonable.
>>
>>102941085
tried your prompt, it was not pretty, but it could be a scene from a low budget horror.
If you've looked at the comfy wrapper can you suggest some settings?
https://files.catbox.moe/oa6nip.webm
>>
>>102941302
that's a good idea, but what kind? much to think about
>>
File: file.webm (321 KB, 856x480)
321 KB
321 KB WEBM
>>102941579
>nta but I'm trying 60 fps + 50 steps on my 3090 and it's taking a bit more than 10 mn
and this is the result kek
>>
>>102941694
I've gotten gens like that, especially with lower step count like 30. Try using 64 steps (the default from the official scripts). There may also be a large RNG component, I haven't done enough gens to get a feel for how consistent it is. Also I used 103 frames. I think the lower the frames the more it fucks up because it was probably trained at the full length.
>>
>>102941694
>>102941751
yeah the quality isn't close either, maybe that's because we're using fp8?
>>
>>102941761
Thanks. Food for thought.
>>102941765
Same, will trying tweaking it on shorter clips, see if the coherence improves.
>>
File: file.png (240 KB, 2608x753)
240 KB
240 KB PNG
>>102941694
>tried your prompt, it was not pretty, but it could be a scene from a low budget horror.
what settings did you use anon? I got this warning maybe it fucks up the video or something?
>>
>>102941700
meth chicks absolutely
>>
File: file.png (16 KB, 1447x91)
16 KB
16 KB PNG
>>102941426
>can't use "enamble_vae_tilting" I got this error
https://github.com/kijai/ComfyUI-MochiWrapper/issues/5#issuecomment-2432777262
Looks like the frame_batch_size must be inferior to 1/6th of num_frames to work
>>
>>102941085
this shit will be goated once we'll be able to do some image2video with it
>>
>>102941761
>Also I used 103 frames. I think the lower the frames the more it fucks up because it was probably trained at the full length.
what's their full length?
>>
>>102941761
>Also I used 103 frames. I think the lower the frames the more it fucks up because it was probably trained at the full length.
I don't think that's the case, his video >>102941085 is 30fps + 3 sec = 90 frames and the quality is amazing, something's wrong with our settings
>>
>>102941827
didn't have any errors anon.
>>
Is this the blessed thread?
>>
>>102941975
increase your frame_batch_size, I noticed that the lower the value, the shittier the quality gets
>>
Very blessed
>>
>>102941990
Yes, thank you.
>>
>>102942024
I suggest you to use his workflow with his values, I have touched them and they work fine
https://github.com/kijai/ComfyUI-MochiWrapper/blob/main/examples/mochi_test_163_frames_01.json
>>
>>102942043
What card do you have?
>>
File: file.png (1.45 MB, 800x1312)
1.45 MB
1.45 MB PNG
>>
>>102942113
H100
>>
>>102942113
a 3090, going for 60 frames + frame_batch_size = 10 "only" asks for 11gb of vram >>102941579
>>
>>102941085
>>102941202
>A young brunette woman stands in a bedroom, facing the camera. She's wearing jean shorts and is topless, her breasts and nipples exposed. She's swaying and dancing seductively.
That looks good, but why does it looks like a old ass video tape? when you see the video they displayed during the release, the images are way sharper, maybe it has to do with the fact they're using the HD models and we're not?
>>
File: file.png (1.59 MB, 800x1312)
1.59 MB
1.59 MB PNG
>>
File: file.png (400 KB, 1024x1024)
400 KB
400 KB PNG
>>
File: file.png (305 KB, 544x448)
305 KB
305 KB PNG
>>
Driving myself insane trying to install flash_attn
fuckin FORGET IT!
>>
>>102942117
Looks like mid-2000s DA, in a good way.
>>
>>102942617
install torch 2.3.1 + cu121

install the cu121 + torch 2.3.1 binary
https://github.com/bdashore3/flash-attention/releases/

profit
>>
>>102941069
based
>>
>>102942676
>install the cu121 + torch 2.3.1 binary
I meant the cu123 + torch 2.3.1 binary, even if it's 123 it'll work anyway
>>
>>102942043
97 frames seems to be the max i can get in a16gb card with 64 steps atm, an earlier run did seem more coherent using the newer .json
I've changed the seed on this run and will see how it turns out.
>>102942617
pip install flash-attn ??
It should know which versions of cuda, torch etc you have and pick one which fits.
I had to use chatgpt to walk me through *shrug*
>>
>>102942733
>pip install flash-attn ??
no it doesn't work that easily, making binaries on windows is hell on earth, you always have errors, I'm just using binaries from someone else kek >>102942676
>>
>>102941840
I wasted the last 20 minutes doing potheads and they just look like slightly sloppier normal people
>>
>>102941085
This will make jeets go buuurrrr
>>
>>
Is anyone here using their models as an extension to traditional digital drawing/painting?
I've tried the krita integration and it was quite the dissappointment. It could enhance the lineart for eyes and hair a bit, but needing to remove the background on every gen took away any productivity boost.
>>
File: file.png (699 KB, 627x736)
699 KB
699 KB PNG
>>102942822
Trump Trump Awooo owo
>>
>>102942845
yes
>https://github.com/centuryglass/IntraPaint
>>
>>102942232
how do you make it draw a minimalistic pen sketch like that?
what prompt do you use?
>>
File: ComfyUI_04408_.png (1.68 MB, 1280x1024)
1.68 MB
1.68 MB PNG
Sometimes you have to go back to Pony and face the things you've lost
>>
out of 1500 Indian broads I genned last night I'll probably only post a small handful. Not feeling it this time.
>>
>>102942970
finally a man with taste
>>
>>102942980
either give them armpit hair or kill yourself
>>
File: file.png (684 KB, 1024x1024)
684 KB
684 KB PNG
>>102942961
i dont have the full prompt anymore but it's just Sana with "sketch" as the main style token and negatives like "render, painting, illustration, photograph"
CFG set to maximum and PAG set to minimum
>>
>>102942845
Yeah, my mistake was trying to do 90% of the job and make AI give it a finish.
No, you let it do 90% of the job and finish it yourself, this may mean the composition isn't what you had in mind originally, this may mean your involvement with the pic was minimal and someone else could have done what you did.
But the point is the end result and turning 8 hours of work into 1, or ending with a pic you couldn't have done without AI at all.
>>
File: file.png (901 KB, 1024x1024)
901 KB
901 KB PNG
these are "sharpie sketch..."
>>
File: file.png (429 KB, 608x416)
429 KB
429 KB PNG
>>
File: file.png (1.55 MB, 1024x1024)
1.55 MB
1.55 MB PNG
>>
Do you people take requests?
I want to create an image but I don't have the necessary hardware.
>>
File: file.png (407 KB, 608x416)
407 KB
407 KB PNG
>>
>>102943132
Subscribe to my Patreon
>>102943138
Nice
>>
>>102943132
no. Go to the requests board or a topic specific board like /d/,/aco/ or /trash/
>>
File: file.png (1.35 MB, 1024x1024)
1.35 MB
1.35 MB PNG
>>
File: file.png (444 KB, 608x416)
444 KB
444 KB PNG
>>
>>102943167
It's not NSFW. It's political and funny.
>>
>>102943183
>political and funny
fuck off.
>>
Just catching up. Is that Mochi model supported in Comfy? Q8 GGUF anywhere?
>>
>>102943188
Dick
>>
File: file.png (419 KB, 608x416)
419 KB
419 KB PNG
>>
File: file.png (1.19 MB, 1024x1024)
1.19 MB
1.19 MB PNG
>>
>>102943047
thank you fren
>>
>>102943199
>Is that Mochi model supported in Comfy?
yes
https://github.com/kijai/ComfyUI-MochiWrapper
>Q8 GGUF anywhere?
Not yet but it'll probably happen soon
>>
>>102943199
I am attempting this >>102941426 right now. HF is screwing me around. Fingers crossed anyways.
>>
File: 00006-1114943620.png (2.05 MB, 1248x1824)
2.05 MB
2.05 MB PNG
>>
File: file.png (637 KB, 608x416)
637 KB
637 KB PNG
>>
>>102942911
Thanks, I'll check it out.

>>102943072
That would suck. This stuff is for a game, so I need to have seperate layers for stuff like hair and eyes, and I need the assets in 4k, which is dog slow on my machine. I'd rather not spend more time training a lora for a custom style, I suck at that.
>>
File: 00012-3929263892.png (2.08 MB, 1248x1824)
2.08 MB
2.08 MB PNG
>>
>>102943250
>>102943256
Nice.
I wonder when we'll get first frame + last frame control. That's gonna be huge.
>>
>>102943132
>>102943183
whats your request?
I'll do it
>>
File: 00039-3579940396.png (2.79 MB, 1280x1920)
2.79 MB
2.79 MB PNG
>>
>>102943340
Whats up
>>
File: file.png (250 KB, 608x416)
250 KB
250 KB PNG
>>
>>102943293
really nice. prompt?
>>
Day 2 of trying to figure out Russian cyberpunk teens on genmo
>>
File: file.png (415 KB, 608x416)
415 KB
415 KB PNG
>>102943384
>eerie, canvas that looks like something familiar, maybe a person at a party, liminal spaces
>>
>>102943401
are you using the ComfyUi node anon? and if yes what are your settings?
>>
File: file.jpg (40 KB, 512x512)
40 KB
40 KB JPG
>>102943327
This image but replace the girl with Nirmala Sitharaman and replace the boy behind with Narendra Modi
>>
File: file.png (1.37 MB, 1024x1024)
1.37 MB
1.37 MB PNG
I swear they're updating the Sana model on the demo.
>>
Explicit age prompts i.e.. "1x year old girl" go through but get moderated immediately. Looks like refunds work though. "tween" helps make them younger but using it with "young" always gets you kids
>>
>>102943447
>Nirmala Sitharaman
I doubt any Human or AI even knows wtf this shit is supposed to be.
>>
>>102943448
Gotta check it
>>
>>102943469
Finance Minister of India
>>
>>102943447
What the fuck
>>
>>102943449
woah, is this the power of Mochi?
what do I need to run this shit?
>>
>>102943490
I swear people ask the most time intensive retarded requests
You'd get 90% of the result and maximum meme value just copy pasting their pictures in Photoshop
>>
>>102943439
I am not, I'm using the website with many Google accounts until I feel like I can wrangle the AI enough to maybe pay for the 8 dollar tier

>>102943504
The power of mochi is probably even better if you can run it locally because no censorship endpoints. You need 24 gigs of ram for 2.5 seconds and 48 gigs for full 5 seconds absolute minimum currently. Wait 2 (more) weeks for quants to start coming out
>>
File: file.png (853 KB, 1024x1024)
853 KB
853 KB PNG
>>102943448
how so? it does go in and out of service ive noticed
>>
>>102943447
saars please kindly do the needful and help this gentleman produce the image, many thanks and best regards
>>
>>102943524
>You need 24 gigs of ram for 2.5 seconds
I got a 4090 so I could do it but 2.5 seconds doesnt really sound all that much.
also how come video length is connected to vram usage?
>>
File: file.png (1.08 MB, 1024x1024)
1.08 MB
1.08 MB PNG
>>102943544
Maybe I'm in a better mood the quality of the outputs feel better
>>
File: 00009-2323832173_cleanup.png (1.85 MB, 1024x1536)
1.85 MB
1.85 MB PNG
>>
File: sana.jpg (248 KB, 1024x1024)
248 KB
248 KB JPG
>>102943448
no changes for me
>>
File: file.png (521 KB, 1024x1024)
521 KB
521 KB PNG
>>102943411
ty anon
>>
>>102943256
I am stuck where the anon who posted the original was stuck.

flash_attn==1.0.5 seems install for torch version 12.1. I am going to upgrade pytorch and "hopefully" not break other things.
>>
File: 00010-385831649_cleanup.png (1.83 MB, 1024x1536)
1.83 MB
1.83 MB PNG
>>
File: file.png (1.23 MB, 1024x1024)
1.23 MB
1.23 MB PNG
>>
>>102943520
AI functions doesn't work on cracked versions and you still can't use AI to edit political personalities even with the official version.
>>
>>102943633
Reread what I said, I didn't say to use AI.
1) your request is a retarded time waster
2) it's not even effective and you would get 90% of the result you want in MS Paint pasting their faces over that original image
>>
File: file.png (541 KB, 1024x1024)
541 KB
541 KB PNG
spooky guy
>>
Anyone tried out the BF16 model?
https://huggingface.co/Kijai/Mochi_preview_comfy/blob/main/mochi_preview_dit_bf16.safetensors
>>
>>102943556
it stores all the images (frames) in vram. Can be coherent if you don't know where you came from.
>>
File: file.png (1.16 MB, 1024x1024)
1.16 MB
1.16 MB PNG
>>
File: file.png (1.08 MB, 1024x1024)
1.08 MB
1.08 MB PNG
CFG 1.1
>>
>>102943657
I'm spoop
>>
File: file.png (445 KB, 608x416)
445 KB
445 KB PNG
>>
>>102943562
Catbox?
>>
>>102943656
I don't want that. I want that anime like aesthetic to be superimposed on them.
>>
>>102943671
So basically local AI videos will never be a thing because the VRAM will always cripple us?
>>
>>102943183
Why don't you do it yourself in a space like this?
https://huggingface.co/spaces/FilipeR/FLUX.1-dev-UI
The only complain is people running out of quotas, but all you need to do is wait for them to replenish.
>>
File: file.png (926 KB, 736x1248)
926 KB
926 KB PNG
>>
>>102943777
I don't care, I'm telling you why no one will help you, fuck off.
>>
>>102943780
People bought VCRs for thousands of dollars. If you think people won't buy magic AI boxes for $10k if it meant they can make videos locally you're in for a surprise.
>>
>>102941123
>piss lora
Link so I can, uh, laugh at how ridiculous that anon is?
>>
>>102943665
I don't think that's possible to run, the fp8 already asks for more than 12gb of vram during inference, so bf16 would overflow the 3090/4090 cards, I'm waiting for Q8_0 personally
>>
File: file.png (129 KB, 1458x644)
129 KB
129 KB PNG
>>102943556
>I got a 4090 so I could do it but 2.5 seconds doesnt really sound all that much.
you can go much longer than that, I went for 8 sec with vae tilt
>>
>>102943302
Oh, right, I've always had the freedom of choosing the style, it would suck if the style already existed and I had to copy it.
Ironically, the best things I did like that were copying it by hand by drawing with my mouse, but it was always drawings by people done in the same way or with a tablet, not the style of game graphics.
I feel that all my drawing skills became obsolete after AI appeared, though.
>>
File: 00015-3119345800.png (1.91 MB, 1024x1536)
1.91 MB
1.91 MB PNG
>>102943751
https://files.catbox.moe/kwfk4y.png
>>
>>102943449
I remember a problem with flux was its complete inability to draw a person between the ages 9 to 18, is this something SD3.5 improves upon? I haven't seen people posting teenage girls.
>>
>>102943834
>you can go much longer than that, I went for 8 sec with vae tilt
pls elaborate
>>
>>102943524
This video would be much better in reverse.
>>
File: file.png (575 KB, 608x416)
575 KB
575 KB PNG
>>
>>102943888
I don't know much what to say, I gave you the settings, you activate the vae tilt, and you'll be able to run long videos onto your 24gb card, as you can see on the picture I went for num_frames = 193 so that's 8 sec (24fps)
>>
File: file.png (407 KB, 608x416)
407 KB
407 KB PNG
>>
>>102943817
he never shared it from what i remember
>>
>>102943777
Gotcha, you don't have any idea about how any of this works, and think it's something simple to do.
What you're requesting would take 15 minutes of my time, it's not a hardware thing where you send some text and your pic and get what you want automatically.
Even if you had the hardware to do it, you probably wouldn't know where to start because you have no idea what it conveys, so just learning what you'd need to do would take a couple hours (not setting it up, assume you had set it up, what's the next step?)
>>
>>102943818
I can't even get the mochimodel loader to show it, it's in the same directory, restarted, refreshed etc, shows the fp8 one fine.
I hope he didn't hardcode the model name -_-
>>
>>102943943
Do you remember if it was any better than other piss loras? Or why would one prefer his to others?
>>
>>102943817
You're in luck, I lurk these threads. Here you go senpai: https://civitai.com/models/779865/flux-female-peeing-wetting-desperation
Check my civit profile for even more piss if that's your thing.
Pro tip: use it with flux dev de-distill, it works even better.
>>
>>102943876
>I remember a problem with flux was its complete inability to draw a person between the ages 9 to 18
It could do those ages, just nowhere near consistently enough.
>is this something SD3.5 improves upon?
I haven't used it but almost certainly not.
>I haven't seen people posting teenage girls.
If I wasn't so busy for the next week you'd probably have seen more by now since I need at least a minute of futuristic ultraviolet Russian lust-provoking-adjacent video


>>102943904
I'm not even prompting for them to be so close and almost kissing, just prompting for them looking at each other and then the camera. It's the data the model was trained on making it suggestive. Not that I have a problem with it because I like the implication (when the gen is actually in the right age range)
>Two beautiful young Russian tween girls with long brown hair, dressed in futuristic angled white plastic outfits, sitting together in the back of a modern car. The scene is bathed in vibrant ultraviolet neon lights as they look at each other and then look at the camera intently. Slow motion. High resolution 4k

If I need to fundamentally reroll 10+ times to get a workable gen I can't justify the price genmo is asking for their cloud shit though
>>
>>102940941
Using ForgeUI, is there an easy way to see or store the trigger words of each Lora?
>>
>>102944006
no clue man, he didn't post any pics of it i don't think. this was back in august so i don't remember much of it besides him saying he had thousands of piss pictures he used to train it. prob could search the archives if you wanted.
>>
File: poopoo.png (66 KB, 360x561)
66 KB
66 KB PNG
>>102944047
hammer & wrench symbol? I sometimes include trigger word in the lora name itself so it's easy to copy+paste
>>
>>102943914
okay man thanks I'll try it out
>>
File: file.png (489 KB, 608x416)
489 KB
489 KB PNG
>>
>>
File: file.png (474 KB, 736x1248)
474 KB
474 KB PNG
>>
>>
>>102944127
:3
>>
>>
>>102944127
profound image
>>
I've met gay furfags who are really into AI and gay furfags who absolutely hate it. What's up with that?
>>
>>102944587
gooners love it, anyone who draws hates it. simple as
>>
>>102944587
autists are only capable of having extreme opinions
>>
>>102944032
>just prompting for them looking at each other and then the camera
So the solution would be to do this the other way.
>>
>>102944627
>anyone who draws hates it
Not true. There's a handful of artists (mostly /trad/ and /gud/) like me who are basically bemused or mildly entertained by AI. I like the process of drawing, AI doesn't take that away.
>>
>>102944025
What the fuck, I wanted information from a third party about if this was any good, I'm not clicking that link because it'd risk ruining the fetish.
>>
>>102944627
as someone who can't draw for shit and who's hand writing looks like parkinson's, AI is one of the greatest things to happen in my life time.
>>
>>102944704
That's cool bro, you're in the former camp then.
>>
her CyberBotoxâ„¢ subscription must have run out at the end


>>102944681
No the solution is to prompt for kissing because video models are always really good at kissing
If you wanna be the brave soul to prompt for tweens kissing attached to your Google or discord account be my guest
>>
>>102940941
I wanna squeeze that Reimu's belly.
>>
>>102944704
Pro tip: Everyone is like that at the beginning, and you automatically become better by just drawing.
Just make 10000 drawings and you'll be surprised to outdo your favorite artist.
>>
>>102944717
No, I said the video in reverse, which does not include kissing.
Also, I'm not going to try it if faces can deform like that, we're supposed to have these threads where people curate what they post and we don't have to deal with nightmare fuel.
>>
>>102944755
Ok but if you're interested in reversing the video you're obviously interested in the implication of them kissing so just gen them kissing
Sadly can't curate genmo since you only get 2 gens per account per 6 hours. It's novel enough right now to post whatever you can get that isn't completely borked
>>
File: hAACCKKtsune.png (2.12 MB, 1296x1728)
2.12 MB
2.12 MB PNG
>>
migu :(
>>
>>102944784
>you're obviously interested in the implication of them kissing so just gen them kissing
>Sadly can't curate genmo since you only get 2 gens per account per 6 hours. It's novel enough right now to post whatever you can g
Do you know how fetishes work? I can see a 20 minute video of a girl taking off her clothes and teasing that she's going to show he boobs and at the very end she unties he bra and the video ends and you don't see anything.
And here's another 20 minute video where another girl starts by showing off her boobs and squeeze them.
The second one is boring so I tune out, the magic is not knowing if she shows them or not, not knowing if they'll kiss.
If I know they will kiss, the ending is spoiled, so video generation does not work for me, i have to rely on not knowing what other anons prompted.
>>
>>
Has any non-fucked way to make an x/y plot appeared in shitstain ui, after FUCKING YEARS of waiting???????
>>
>>102941314
>>102944718
(You) are mentally ill
>>
>>102940941
https://github.com/bghira/SimpleTuner/commit/7e7c2f81
>sd3: add some sob stories about why it is hard to train
It's fucking over...
>>
>>102944587
The haters are the losers.
>>
>>102945164
I think all the gay furfags are the losers, but maybe that's just me
>>
I know this is unrealistic but my dream is to one day gen a 1girl so undeniably beautiful that she gets at least 6 (you)s
>>
File: grid-0849.jpg (746 KB, 2432x3328)
746 KB
746 KB JPG
>>
>>102945387
she's just like me
>>
File: grid-0850.jpg (847 KB, 2432x3328)
847 KB
847 KB JPG
>>
>>102945295
instead of generating a 1girl for the (You)s, generate something truly horrifying. something that makes people uneasy when looking at it that they have to respond.
>>
>>102945115
I don't know why people are such bitches. 1) every model is going to use different hyperparameters, 2) int8 training is inherently unstable 3) you really should be capping grad norm while training.

But also fuck SAI for not releasing their training code or at least giving pseudo code of the training process.
>>
File: grid-0840.jpg (1.48 MB, 2432x3328)
1.48 MB
1.48 MB JPG
>>
File: grid-0848.jpg (672 KB, 2432x3328)
672 KB
672 KB JPG
>>
>>102945414
My posts with the most Yous were text only. Just say something very outrageous, but that some people could believe, and people may keep referencing what you said months later.
>>
>>102945115
Yeah no I don't believe this. I mean I believe the guy had problems making it stable, but I doubt there's any fundamental problem.

SD3 is architecturally very similar to flux, but should be even more "normal" because it doesn't have distillation. Flux loras train very well and learn concepts quickly, and are stable across a wide range of hyperparameters. Something about the SD3 implementation in the train script is fucked up I would guess.
>>
>>102940941
>has been participating on the thread for more than 5 hours.
>just noticed I'm on the collage
I was this close to going to sleep and come back tomorrow top a new thread and never noticed I was featured.
>>
File: 1717570623982774.webm (830 KB, 720x720)
830 KB
830 KB WEBM
OLD MEME GUIDES:
https://files.catbox.moe/3az283.jpg
https://files.catbox.moe/e5mzsc.png
https://files.catbox.moe/5ix69v.png
https://dallery.gallery/the-dalle-2-prompt-book
>>
Who will get the first Real finetune? Flux of SD3M?
>>
File: gigu.png (2.51 MB, 1728x1296)
2.51 MB
2.51 MB PNG
>>
>>102945597
or*
>>
>>102945614
now this is art
>>
>>102945414
I can get (you)s the old fashioned way if I want, but it's very difficult to pull (you)s through 1girling alone. Getting six purely for the 1girl's beauty would be a triumph. I've had a gen go so viral that I saw it on twitter the day after (not AI twitter, regular twitter). That's fun, but I'd rather get 6 appreciative replies on a genuinely good 1girl.
>>
>>102945614
what the hell anon
>>
>>102945614
this man knows about comfy org
>>
>he gens for (you)s instead of for the love of the craft
Sad
>>
>>102945552
widebaker snubs us 1girlers every time. why even look? thankfully we have a good baker today
>>
>>102945614
holy kek
>>
>>102945614
LOOOL
>>
>>102945746
Being featured on the OP used to mean something, then I realized I could just create a new thread and feature myself if I wanted.
Anonymity means there's no difference.
>>
Holy shit, merging your chosen art style lora into the 24GB flux and then quantizing it improves speed AND quality. Maybe this is obvious to everyone but I didn't expect the quality to improve as well, I was just doing it for the speed.

flux1-dev-Q8_0.gguf + one lora: Prompt executed in 59.42 seconds
flux1-dev-mergedlora-Q8_0.gguf: Prompt executed in 44.51 seconds, and better quality

RTX 3090, 30 steps. When benchmarking these times I made sure to load the lora and model into memory first so that part's not interfering with the time.
It's a bit of a complex process so I might write a guide (if there isn't already one) to make the whole thing easier for people.
>>
>>102945597
whichever is the best model that is actually finetuneable on a 5090 at a pace faster than a snail
>>
>>102945680
I got more than a couple of (You)'s generating 1girls doing something disgusting. i stopped, because i don't want to be known as the resident degenerate.
>>102945721
I don't gen for (You)'s, but it is nice when at least 1 anon is entertained by something you did.
>>
>>102945614
So this is what takes to get 6 (Yous)?
>>
>>102945721
Nobody gens 1girl for (you)s. This is what genning for (you)s looks like: >>102945614

Genning 1girl with a small hope of getting (you)s is a different thing. Don't conflate the two. It's like saying that "I hope I can write such a compelling history of Chenla that 1000 people will read it" is "writing for fame"—not quite true
>>
>>102945840
And then you'll find about solipsism and realize all other people were part of your dream and that you really never helped anyone.
>>
>>102945818
I'm not sure if OP even gens desu
>>
>>102945897
>i don't want to be known as the resident degenerate
That's not how 4chan works, in 4chan you come back the next day as if nothing happened and nobody knows it was you who posted all that.
>>
>>102945818
>then I realized I could just create a new thread and feature myself
I've done this, but only when my gens really deserved it and the other baker at the time was an enemy who would surely snub.
>>
>>102945943
>This is what genning for (you)s looks like:
How do you know it wasn't the same anon reacting to his own image and that it didn't get a single (You)?
That's a bit of a problem and you never know.
>>
>Local Diffusion General
>>
>>102945970
I shat up /o/ so much with my meme images that people suspect me even when i don't attach an image to my post if i say something that even vaguely ties me to my images. its certainly possible on slower boards and generals.
>>
>>102945980
The last time I featured myself at /sdg/ everyone refused to use my thread, and they created a new one, and with two active threads, someone created a joke third thread, and the jannies nuked 6 entire generals to stop the spam of threads.
And I'll never get tired of telling that history, new threads were getting autonuked for a while, it was like a major event that happened just because people didn't like my generation being on the OP.
>>
>>102945984
could be, but it wouldn't change my contention that it was being posted for (you)s. Trying to get the ball rolling with some self-replies is classic (you)-seeking behavior
>>
>>102946019
Yeah, people nicknamed me Win7kun just because of the way I talked and even remembered on what OS I'm on.
>>
>>102946035
id feel proud of that. holy shit you must've made something so mundane it probably just felt insulting.
>>
>>102945943
Just a failed gen that I found funny
>>
File: 1722579911570925.jpg (130 KB, 1024x1024)
130 KB
130 KB JPG
>>102946012
Is there a local Udio or Suno yet?
>>
>>102946081
no offense meant, we all gen for (you)s sometimes
>>
File: cherries.png (401 KB, 512x512)
401 KB
401 KB PNG
>>102946064
It was this one, people deemed it so unacceptably ugly that they'd rather have everything nuked than having it as the OP for a thread.
Later we had farting Mr Incredible as the OP with 4 threads with different versions of him active at the same time and nobody cared, how times change.
>>
File: file.png (305 KB, 1248x992)
305 KB
305 KB PNG
>>102922743
>the real benchmark is realistic
looks like sana passes
>>
>>102946237
https://desuarchive.org/g/thread/95306221/#95307111
>anti-loli board
lol, what?
>>
File: ComfyUI_01527_.png (1.13 MB, 1024x1024)
1.13 MB
1.13 MB PNG
>>
File: ComfyUI_01528_.png (1.1 MB, 1024x1024)
1.1 MB
1.1 MB PNG
>>
>>102945103
Why??
>>
>>102946287
Ahh, that made me realize how much I miss the xkcd forums.
Life regrets.
>>
>>102946397
going through that reminds we why i use to ignore all the text and just post/look at pictures for ideas.
>>
>>102946274
if sana was a tranny on a tranny affirmation subreddit then yes
>>
File: file.png (316 KB, 1248x992)
316 KB
316 KB PNG
>>
>>102946362
Love it
>>
>>102945115

>Official training code was not released alongside SD3.5, leaving developers to guess how to implement the training loop based on the [SD3.5 repository contents](https://github.com/stabilityai/sd3.5) which leaves us with possibly subpar results.

The link he posted does not work, so I googled this https://github.com/Stability-AI/sd3.5

But, I thought there was already at least one example of training code on stabiity's own site, no?
https://stabilityai.notion.site/Stable-Diffusion-3-5-Large-Fine-tuning-Tutorial-11a61cdcd1968027a15bdbd7c40be8c6
>>
>>102947143
>But, I thought there was already at least one example of training code on stabiity's own site, no?
But that uses SimpleTuner...
>>
Hibernation mode
>>
>>102947191
Are you saying that everything in that page is fake and that its impossible to train the model?
>>
>>102941486
4x4090 for a 480p video?
damn why dream of coom@home will never come true
>>
File: ComfyUI_01546_.png (1.17 MB, 1024x1024)
1.17 MB
1.17 MB PNG
>>
File: ComfyUI_01548_.png (1.41 MB, 1024x1024)
1.41 MB
1.41 MB PNG
>>
File: ComfyUI_01553_.png (1.47 MB, 1024x1024)
1.47 MB
1.47 MB PNG
>>
>>102946237
even totally nonsexual images of young girls are off-limits here, we are so pathologically certain of our own guilt that to take any kind of aesthetic pleasure in anything relating to the young is presumed irredeemably wicked.
>>
how many Windows vs Linux users are here?
>>
>>102947798
Arch user here running Forge on a chinese 3060 from aliexpress.
>>
>>102947523
ai-toolkit works fine
>>
>>102947798
https://strawpoll.com/kjn1DAQv7yQ
>>
>>102947798
>>102947980
what if we use both?
>>
File: flux_xl.jpg (1.52 MB, 3552x1776)
1.52 MB
1.52 MB JPG
seems I always have to use XL to get rid of the waxy flux look
>>
>>102947798
Linux has better support desu
>>
File: 00017-144219753-1.png (821 KB, 1024x1024)
821 KB
821 KB PNG
>>
>>102948003
prompt and settings, retard?
>>
>>102948019
Handsome lad. Shame the twisted democrats trooned him.
>>
>>102946090
Considering they probably used a few PB of properly tagged songs, who even has that.
>>
>>102948099
heres the issue, flux takes too long to generate which makes it harder to test different settings and prompts because i dont have all fucking day
>>
>>102948003
I would do this if using XL didn't fuck the details up with it's inferior VAE
>>
>>102948313
are there any specific vaes that could minimize the damage
>>
>>102948335
no because SD3s VAE is (I think) 16 channels and XLs is less
>>
>>102944111
Wow, for some reason, when I pressed that, it showed me a bunch of code, I tried it again, and now it shows the interface.
>>
>last gen was posted an hour ago
>>
chill anon
>>
https://huggingface.co/Freepik/flux.1-lite-8B-alpha

>We are thrilled to announce the alpha release of Flux.1 Lite, an 8B parameter transformer model distilled from the FLUX.1-dev model. This version uses 7 GB less RAM and runs 23% faster while maintaining the same precision (bfloat16) as the original model.

I wish they'd tell us whether this is compatible with all existing flux loras.
>>
>>102948557
>an 8B parameter transformer model distilled from the FLUX.1-dev model.
so they distilled a distilled model? you can't make this shit up man, wtf?
>>
>>102948567
Yep, they just created the immovable object.
>>
File: file.png (133 KB, 989x980)
133 KB
133 KB PNG
>>102948557
now that's interesting, why can't we use this knowledge to make better quants instead? a GGUF that only focus on the layers that shit their pants on would be a good idea
>>
>>102948557
It shouldn't be because it has different parameter count.
>>
>>102948557
Huh? This has nothing to do with BFL right?
>>
so uh... you gunna dedistill this one too or
>>
>>102948644
not at all, it's just some random who want to make Flux as the same size as SD3.5 I guess
>>
File: file.jpg (1.1 MB, 3600x1800)
1.1 MB
1.1 MB JPG
>>102948557
meh, looks like a bf16 vs fp8 version, why would we go for a 8b model if we can simply go for fp8 instead, because fp8 12b is way much lighter than bf16 8b
>>
>>102948680
Because you can train the 8B model easier
>>
>>102948693
>Because you can train the 8B model easier
it's still a big ass motherfucker, and it's still distilled so... at least on the 12b version we got de-distill
>>
>>102948723
It's easier to do the de-distill, and your de-distill 12B still needs someone to train it because I don't know if you noticed, but the base model is actually ass outside of Trump memes.
>>
>>102948557
When will you give up on Flux? It's not happening, anon.
>>
>>102948737
>12B still needs someone to train it because I don't know if you noticed, but the base model is actually ass outside of Trump memes.
I totally agree with you on that anon, if Flux dev has more concepts it would be the goat (local and API included)
>>
File: file.png (17 KB, 194x259)
17 KB
17 KB PNG
>>102948747
what's the alternative? sd3.5?
>>
>>102948760
Yeah which means someone ultimately has to train it for 200,000+ steps. 8B will essentially be able to do that 40% faster.
>>
>>102948769
Shut up frogposter
>>
>>102948747
Its becoming a meme at this point.
>>
>>102948519
desu im feeling uninspired or perhaps simply tuckered out
>>
File: file.png (1.07 MB, 1024x1024)
1.07 MB
1.07 MB PNG
>>
File: file.png (70 KB, 1125x402)
70 KB
70 KB PNG
>>102943969
>I hope he didn't hardcode the model name -_-
I have the same issue as you anon, I downloaded the bf16 but it's not recognizing it at all
>>
sorry, once my /pol/ ban expires i will take this trash elsewhere
>>
why can any random realistic pony merge do anatomically correct penises and pussies but flux, regardless of the loras used, still generates a mangled mess? Will it ever be able to do this?
>>
File: ThisTook10minutes.webm (623 KB, 856x480)
623 KB
623 KB WEBM
>>
>>102943969
>>102948860
ok it works now that I updated the node, now there's an alternative to flash_att which is sage but it needs triton and when I run it I got compilation errors so...
https://github.com/kijai/ComfyUI-MochiWrapper
>https://github.com/kijai/ComfyUI-MochiWrapper

>>102948919
yeah the quality is much worse than what's in the API demo, maybe that's because we're not using the HD version, did you go for fp8 though? I'll be trying bf16 and see if that improves stuff
>>
>>102948906
Because base Pony was trained on penises and pussies and Flux was not? A core concept like that takes hundreds of thousands of steps.
>>
>>102948906
pony is based, flux is not simpleas
>>
>>102948934
>A core concept like that takes hundreds of thousands of steps.
so it's not happening any time soon?
>>
>>102948906
I think the realistic pony is in a way an "artstyle" swap from anime to realism. So it has the intelligence of the hentai models behind it.
That base intelligence was paid for in tons of money and training by the Pony guy (and luck apparently?).

As for Flux I think because it's a much larger model, it costs a lot more money to rewrite it. Keep in mind, SDXL was overwritten by booru training, so it actually lost a lot of its original SDXL stuff. To do so with Flux would cost a lot.

Also the distillation makes it extra hard to train.
>>
File: file.png (49 KB, 1232x759)
49 KB
49 KB PNG
>>102948932
>I'll be trying bf16 and see if that improves stuff
Unfortunately you can't go further than 61 frames, after that you'll overflow a 24gb card
>>
>>102948948
I'm sure someone based will do with their 5090 when it comes out. But you're still talking like 10,000,000 seconds of training time which is like 100 days.
>>
File: file.png (2.69 MB, 2249x1392)
2.69 MB
2.69 MB PNG
For those interested, Omnigen is released
https://xcancel.com/cocktailpeanut/status/1849201053440327913
>>
File: file.png (1.59 MB, 1344x768)
1.59 MB
1.59 MB PNG
>shot from below, family looking down the camera and smiling, father on the right, mother on the left, boy and girl in the middle, happy family
kek, this was made with SD3.5, can Flux do it?
>>
>>102948932
fp8 yeah. Really not impressed so far, maybe my settings are bad but it barely feels like an upgrade over Cog
>>
File: file.png (547 KB, 1024x1024)
547 KB
547 KB PNG
>>102949000
hm
>>
>>102949023
why is the model so shit at upside down people
>>
>>102949023
i really feel like this "on the left" "to the right of" shit is a dead end. even the biggest LLMs can fuck up "where is suzy?" word problems and we expect these shitty text encoders to get it right. there has to be a better way of encoding positional/relation information in the scene than paragraphs of text.
>>
>>102949058
because they have a shit dataset
>>
File: tfw.png (387 KB, 608x610)
387 KB
387 KB PNG
>>102949023
>>
>>
>>102949034
yeah the quality image is shit but I like their idea, you can put any image of any celebrities or anime characters as an input, and it just works

>>102949058
not enough pictures with upside down people, and it doesn't help they train their model on """"ethical"""" pictures, they are retarded and China will destroy everything by not giving a fuck about the copyright nonsense
>>
File: file.png (148 KB, 2333x753)
148 KB
148 KB PNG
>>102948967
>Unfortunately you can't go further than 61 frames, after that you'll overflow a 24gb card
Ok that's interesting, I did a second run and now I can go up to 80+ frames without overflowing the card
>>
>>102949089
>""""ethical"""" pictures
Pretty sure there are plenty upside down porn images indeed.
Though I always wondered if this was stripped from the dataset or just non captioned by the vlm.
DALLE had no problem with that kind of shot if I remember well, but then again DALLE had plenty porn, the filtering was post hoc.
Which is smarter than whatever pre-filtering nonsense all the other companies seem to do, because despite being smaller it gets anatomy so much better.
>>
>>102949028
>fp8 yeah. Really not impressed so far,
I just did a bf16 run and you don't get the ghost morphing glitch anymore, the images are in way better quality, fp8 is a fucking meme man, I hope they'll make a Q8_0 version
>>
>>102949141
>fp8 is a fucking meme man,
at this point, I'm wondering why can't we have something in the middle, fp12 would be a sweet spot for Flux and Mochi
>>
>>102949131
How long does it take you per 80 frame video? 80 frames (just over 3 seconds) is workable for local I guess
>>
>>102949148
2 4 8 16 32 64 128
Notice something?
>>
File: file.png (14 KB, 1586x67)
14 KB
14 KB PNG
>>102949149
>How long does it take you per 80 frame video?
For 85 frames + 65 steps -> 25 minutes
>(just over 3 seconds)
I still don't know what's the fps of Mochi, is it 24fps?
>>
>>
>>102949089
>China will destroy everything by not giving a fuck about the copyright nonsense
it's even worse when you think about it, the current meta in the west is that you need to ask for consent each holder of a public image or art so that a model can look at it, aka complete insanity
>>
Fresh

>>102949176
>>102949176
>>102949176
>>
>>102949061
>there has to be a better way of encoding positional/relation information in the scene than paragraphs of text
Omnigen is the answer. You can compose images with images and modify them not only with text but with images as well.
>>
>>102949155
>What is exllama2?
https://github.com/turboderp/exllamav2
this software can go for any bits you want and it's still fast as fuck, so no anon, I won't accept that 2^N bit excuse
>>
>>102949167
>complete insanity
it is insanity, there's currently no laws that prevent you to train your model with whatever you want, those retards are shooting themselves in the foot
>>
>>102949182
I'm excited to see your code.
>>
File: file.png (1.51 MB, 1056x594)
1.51 MB
1.51 MB PNG
>>102949196
>>
Prompt adherence om data close to what it's seen (i.e generating variations) seems laking for genmo, or maybe not idk it's hard to test with only 2 prompts every 6 hours

>>102949162
>25 minutes
Fuck. Hopefully it's at least under 20 with the 5090, but since it'll have 32gb VRAM it'll be able to do longer videos and so it'll still take 25 minutes probably kek
>>
>>102949456
>Fuck. Hopefully it's at least under 20 with the 5090
it will be way faster for sure, I'm only using a 3090
>>
>>102948967
vram is a scam. The inference engines we have are just bad at memory management. NO consumer gpu is starved for ram. They might be starved for cache though.
>>
>>102949456
I like the look, but it's somehow very horrible at the same time, like toxic somehow. I feel like it's a kind of vr world where something horrible will happen, then the protagonist will wake up in shock from the dream.
>>
>>102949794
>toxic somehow
I'm kinda going for the "everything looks normal but something feels very wrong" for my dystopian cyberpunk art project so if you feel that way in your gut from seeing genmo gens it's actually a good thing for my use case



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.