[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: 1733946694528578.mp4 (747 KB, 1088x960)
747 KB
747 KB MP4
Discussion of Free and Open-Source Diffusion models.

Last bread : >>103482892

>Local (Hunyuan) Video
Windows: https://rentry.org/crhcqq54

>UI
Metastable: https://metastable.studio
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
reForge: https://github.com/Panchovix/stable-diffusion-webui-reForge
ComfyUI: https://github.com/comfyanonymous/ComfyUI
InvokeAI: https://github.com/invoke-ai/InvokeAI

>Models, LoRAs, & Upscalers
https://civitai.com
https://tensor.art/
https://openmodeldb.info

>Cooking
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts

>Flux
Forge Guide: https://github.com/lllyasviel/stable-diffusion-webui-forge/discussions/1050
ComfyUI Guide: https://comfyanonymous.github.io/ComfyUI_examples/flux
DeDistilled Quants: https://huggingface.co/TheYuriLover/flux-dev-de-distill-GGUF/tree/main

>Guides & Tools
Share the Sauce: https://catbox.moe
Perishable Sauce: https://litterbox.catbox.moe/
Generate Prompt from Image: https://huggingface.co/spaces/fancyfeast/joy-caption-alpha-two
Artifact resources: https://rentry.org/sdg-link
Samplers: https://stable-diffusion-art.com/samplers/
Open-Source Digital Art Software: https://krita.org/en/
Txt2Img Plugin: https://kritaaidiffusion.com/
Collagebaker: https://www.befunky.com/create/collage/
Video Collagebaker: https://kdenlive.org/en/

>Neighbo(u)rs
>>>/aco/sdg
>>>/aco/aivg
>>>/b/degen
>>>/c/kdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/tg/slop
>>>/trash/sdg
>>>/u/udg
>>>/vt/vtai

>Texting Neighbo(u)r
>>>/g/lmg
>>
Is there a way to pause a generation or the next in queue in comfy?
>>
https://github.com/kijai/ComfyUI-HunyuanVideoWrapper/issues/119
>I uncommented some of the code for CFG and negative prompts and have been playing around with it a bit. This model works just fine with CFG, as long as embedded_guidance_scale is 1, or a low value. Both in general use, and especially for loras, it can improve prompt adherence. I think there's no reason to have CFG disabled. The model works like Flux dev, you can use CFG as long as you have the settings right.
I mean why not but it's 2x slower with CFG, I don't want that lol
>>
File: HunyuanVideo_00017.webm (379 KB, 352x640)
379 KB
379 KB WEBM
My 2-second videos are smooth, but my 5-second videos are choppy
Is it like this for everyone, or does this indicate that I fucked up my settings?
>>
>>103487150
did you go for 24 fps? because that's the official frame rate of hunyuan
>>
File: 1729415745465844.jpg (105 KB, 984x984)
105 KB
105 KB JPG
>>103487099
>>103487168


Ok
>>
File: HunyuanVideo_00036.webm (120 KB, 352x640)
120 KB
120 KB WEBM
>>103487171
Yeah, all of my stuff so far has been 24 fps, but some have been way smoother than others
>>
>>103487099
can hunyuan do img2vid?
>>
>>103487200
are you able to make her any younger perhaps 18 or 19
>>
>>103487232
two more weeks
>>
File: 1710375415823027.jpg (120 KB, 984x984)
120 KB
120 KB JPG
>>103487242
For what purpose? IRL she's 23. Isn't that young enough?
>>
>>103487099
name?
>>
Blessed thread of frenship
>>
>>103487248
Source your ass?
>>
>
>>
>>103487440
Keep truthin' bro.
>>
>>103487452
Can this run on a local machine now?
>>
>>103487459
yeah you just need a 4090 for an ok time or a 3090 for a bad time or a 3060 12gb for a time
>>
>>103487459
This is the local bread after all, anon
>>
>>103487473
>>103487474
Suppose I want to make a 5-second video. How long would that take to make assuming I have a 4090?
>>
File: ComfyUI_122715_.png (2.68 MB, 1248x1824)
2.68 MB
2.68 MB PNG
Euler Ancestral Beta with SD 3.5 Medium is pretty cool
>>
>>103487473
What about a 4080?
>>
>>103487515
nothing's more important than vram anon
>>
>>103487452
wasnt mochi1 that promised i2v?
>>
>>103487515
for hunyuanvideo? no, I don't think you want to do that. at the not so fantastic native resolutions it takes minutes to get two seconds of video on a 4090.

if you want to run ltx (completely different model, weaker), that's a different story.
>>
>>103487595
maybe? for hunyuan the imagetovideo checklist is right on the github page
>>
>>103487595
yeah, mochi promissed mochiHD and mochi i2v but at this point I don't give a fuck about that anymore
>>
>>103487515
Can use fp8_fast and torch compile so it's a lot faster than the 3090 but it's much more restricted on frame count/resolution. It's an interesting trade off but the 4090 is undoubtedly king
>>
Can it do porn? Or just videos of girls standing by the beach? Anime?
>>
>>103487633
>Can it do porn?
yes >>>/aco/8639708
>>
>>103487645
Hell yeah, hope it can do kemono, finally a reason to dump $3000 on a graphics card.
>>
>>103487653
>hope it can do kemono
it can, can do furry aswell, this model is amazing
>>
>>103487521
>>103487452
hmmm bro how old is she?
>>
File: 20241130_030245.jpg (58 KB, 803x767)
58 KB
58 KB JPG
>>103487200
>not nxttakatta
>>
>>103487452
>the github roadmap said
github roadmaps say lots of things... do they ever happen? idk....
>>
>>103487675
She's not real anon
>>
>>103487633
>Can it do porn
Yes, definitly. But it's certainly not a pony/illustrious for video in terms of what porn it actually knows. At least not with the current text encoder.
>>
>>103487989
Eh, still probably worth grabbing a 5090 for what will be in the pipeline
>>
is it only in this thread or where else can a coomer such as my friend find these
>>
>>103488003
Read the OP
>>
>>103487998
Oh sure. It's not like this global lewding project is going to stop at x specific model or y lora anyhow, and hunyuan IS a good video model AND uncensored which is actually also just far less infuriating anyhow even if you want SFW humans
>>
Hunyuan quant that I can run with 8gb vram and 64gb ram when?
>>
File: 1711252009215114.png (1.86 MB, 884x884)
1.86 MB
1.86 MB PNG
>>103487834
>nxttakatta
Who?
>>
>>103488041
you can run it with 8gb, if you're happy with 320x240 and no more than 25 frames
>>
>>103488065
Better than nothing I suppose.
Guess I'll try it thanks anon.
>>
>>103488041
>>103484235
>>103480445
It takes time but able to be done at bigger resolutions and longer frames.
>>
>>103488113
Owner of the hyvid wrapper repo poopooing gguf quants for no reason is annoying. We can torch compile on a 3090 with q8 gguf
>>
File: themartialarts.webm (518 KB, 288x480)
518 KB
518 KB WEBM
>>
>>103488149
kung pow.

I wonder if you could take this vid, upscale the resolution, vid to vid it and get something that wasn't shit.
>>
File: HunhuyanVideo_00013.mp4 (903 KB, 720x720)
903 KB
903 KB MP4
>>
>>103488174
migu :(
>>
File: themartialwat.webm (1.16 MB, 960x544)
1.16 MB
1.16 MB WEBM
>>103488163
Not sure you can very closely reference the video but generate higher resolution just yet. But also it feels like attempting higher quality anything is better left for later. When we have the text encoder and all that, perhaps some tools will be more performant too.
>>
File: themartialarts2.webm (482 KB, 960x544)
482 KB
482 KB WEBM
>>
>>103488174
>tfw your sex doll pops a leak and all the jizz splashes out
>>
>>103488196
Ashi Bunshin no Jutsu
>>
How do we force chang to hand over mllm weights?
>>
File: out.webm (218 KB, 640x704)
218 KB
218 KB WEBM
>tfw your sex doll comes to life
>>
>>103487099
Who's the girl on the left?
>>
>>103487099
>>103486990
>>103486549
>>103484226
I'm on a 3090. Can anyone else that got 1280x720p at 129 frames using a 3090/4090 post a screencap of their comfyui setup? (I have 128gb ram, and 4 3090s actually btw).
>>
>>103488312
A non-existant AI girl
>>
File: 00046-4222309965.png (2.15 MB, 1024x1280)
2.15 MB
2.15 MB PNG
>>
>>103487452
prompt brother?
>>
>>103488350
That's fucking cool. Tips?
>>
File: 00047-1951914986.png (2.02 MB, 1024x1280)
2.02 MB
2.02 MB PNG
>>103488360
prompt was
1girl, striped shirt, portrait, sketch, abstract

<lora:koffoid:1>
<lora:add_detail:1.6> <lora:detail_slider_v4:2.7>
<lora:LCM_LoRA_Weights_SD15:1>
lcm sampler, 1 cfg
>>
>>103488346
But I want to marry her
>>
File: trainedbysouthkorea.webm (516 KB, 640x368)
516 KB
516 KB WEBM
>>103488229
A true beginner-level move.
>>
How do I achieve sora level txt2vid gen locally?
What's the best I can have?
>>
>>103488418
stable video diffusion
>>
>>103488418
>sora
>best
>>
File: HunyuanVideo_00048.mp4 (579 KB, 544x960)
579 KB
579 KB MP4
>>103488403
I'm sorry to hear that, anon
Here's a longer version
>>
>>103488418
>sora level
nobody tell 'em
>>
>>103488437
Thank you mate. AI has been a blessing for my dick so far.
>>
>>103488422
it's shit
>>
>>103488382
1.5??? Wow.
>>
File: movebehindandsmack.webm (373 KB, 640x368)
373 KB
373 KB WEBM
>>103488418
technically you can have sora-level amounts of hgx gpus at home and actually there are some fat video models for that, but the cost is a bit high...

maybe do what us low level peasants with typically one and at most a handful of 3090/4090s do
>>
>tfw I have to open up crusty old incel wsl2 to fuck around with what's going to be shitty LoRA making for Hyvid

Why cant these people just learn to use windows?
>>
>>103488469
what models
>>
>>103488519
Most of the machine learning research have been done on Linux because Python is a fucking mess on Windows and most of the high performance code libraries aren't on there and come from HPC backgrounds where Linux rules supreme. The question is why shouldn't people be learning Linux instead?
>>
linux wipes my grundle with its tongue
>>
>>103488531
hunyuan video. also ltx mochi cogvideox pyramidflow, depending
>>
>>103487452
>>103487521
wait a second...
>>
Gguf me up already, just inject those quants directly into my veins. I NEED it
>>
>>103488693
rf inversion is more important, whatever the fuck that is
>>
>>103488149
Insane sovl
>>
I'm guessing I'm better off setting my 2060 12gb on fire and breathing in the fumes to see visions than try to run the generations.
>>
>>103488670
how much vram do I need for hunyuan video
>>
>>103488751
no i'm running a 1080 TI, we're kindred spirits here, at least you have extra vram leeway.
unless you mean vidgen then i have no clue how you even considered that train of thought at all.
>>
>>103488741
agreed. it's probably not very hard to make a lot more that looks like vintage kung fu movies too.

>>103488751
if you're patient and accept lower resolution, I suppose you can try hunyuan?

ltx for example should be quite easy tho. and of course you can imagegen with a lot of models.
>>
>>103488817
>gen low res vid using hunyuan
>upscale using img2img
will this be consistent?
>>
>>103488780
i think >>103488065 ? but it's clearly much better to have 24GB or even the really expensive nvidia cards given the resolution&clip length vs time to generate

the actual authors recommend 80GB VRAM
>>
>>103487200
d-did psycheswings get a reduction?
>>
File: 1732533678164381.png (283 KB, 476x362)
283 KB
283 KB PNG
I need to know how hunyuan handles pregnancy, for scientific reasons
>>
File: k.webm (516 KB, 272x480)
516 KB
516 KB WEBM
>>103488897
Using a video model like LTX might work.

Doing it with flux/sdxl/whatever frame by frame? I doubt that this makes any sense unless you want, like, a ton of stuff popping up and disappearing one frame each.
>>
>>103488919
that's with their yet unreleased text encoder and the model both in memory together, producing the "minimum" of 129 frames @ 960x544
you can get away with a lot, lot less
>>
>>103488215
"Hey look who came out of their cave, why don't you show Mrs. Johnson some of those karate moves you've been practicing?"
>>
https://github.com/kijai/ComfyUI-HunyuanVideoWrapper/
can anon confirm this works on linux?
>>
>it can't do Taylor Swift
please tell me this is just a skill issue and not another KEKED MODEL
>>
>>103489090
https://github.com/C0untFloyd/roop-unleashed
>>
>>103489090
that name means nothing in China
>>
File: NIGGERS.mp4 (285 KB, 352x352)
285 KB
285 KB MP4
>>
>>103489090
best we can do is billie eillish bouncing on a ball with her fat ghetto ass jiggling out
>>
>>103489049
I will try it tonight or tomorrow. I want to fogure out what the best current settings are though for 24gb vram.
>>
>>103489113
>NIGGER bird has gained video sentience
BASED
>>
>fat milk whore OP
>nigger bird in motion
>pregnant karate
Blessed thread
>>
File: pregnant.webm (634 KB, 272x480)
634 KB
634 KB WEBM
>>103488973
it does
>>
>>103489173
Now someone just needs to use the new lora training capability to train a Cerfukin lora.
>>
>>103489178
>final day of no shit september
>>
>>103489194
I will personally rip the teeth out of that turkish faggots mouth, no need for a lora
>>
File: bogsylvania.mp4 (339 KB, 720x480)
339 KB
339 KB MP4
the lora we really need is a BOG lora so we can do more like vidrel but with far more motion
>>
File: controlissue.webm (451 KB, 272x480)
451 KB
451 KB WEBM
>>
literal cunny machine
>>
>>103489318
why do half the girls you post look like they have downs
>>
File: PA_0012.jpg (868 KB, 1664x2432)
868 KB
868 KB JPG
>>
can hunyuan generate cctv style footage like this?
https://youtu.be/Hd6Z_pGx9_4
>>
You pedo pervs are going to get this stuff banned and shut down. When you need to get a license to buy graphics cards remember your brazen acts.
>>
File: PA_0021.jpg (1.07 MB, 1664x2432)
1.07 MB
1.07 MB JPG
>>
>>103489217
Is this even legal in the United States?
>>
>>103489388

I just hope it all goes to shit only after I can make sex scenes with my hot coworker in it, then they can take my goods.
>>
pixartsexuals i feel a rumblin
>>
File: PA_0023.jpg (802 KB, 2560x1536)
802 KB
802 KB JPG
>>
I shoulda saved anon's hunvid catboxes. Is there a current meta workflow?
>>
>>103489388
They tried to ban books once, you know
>>
Remember when Catpiss got a loveletter from their ISP.
>>
File: out.webm (130 KB, 960x544)
130 KB
130 KB WEBM
hags
>>
all the people posting pedo gens are most likely going to get V&
>>
>>103488964
Nope. I don't think she ever will.

https://www.tiktok.com/t/ZTYH5vSXW/
>>
File: ayylien toes.mp4 (156 KB, 960x544)
156 KB
156 KB MP4
>>
Ai booru's pictures aren't loading for me but judging by the comments I appear to be the only one affected by this . Any help?
>>
>>103489497
its just the one dude that already got shit on earlier but somehow had a big defense force
before i had little opinion on the matter but now i'm 100% convinced its just an OP to start getting regulations going and these threads sanitized
>>
>>103489217
Did you prompt for slow-motion? Can it do that?

Or timelapse?
>>
man this is one clip i'd love to vid2vid with a proper art style
its so hot
https://gelbooru.com/index.php?page=post&s=view&id=8221948&tags=tinker_bell_%28disney%29+
>>
>>103489534
Even if it is just one faggot posting it, he's still going to get fucked because mods and jannies have contact with governmental agencies and report CSAM to them, real or ai generated.
I totally agree though, it seems like there's groups intentionally spreading it around to get the government to regulate AI gens to the elite only.
>>
>>103489234
I'm currently testing a Furkan LoRA on Hyvid. I'll do this one if it turns out promising.
>>
Trying my hardest to not reply to lowiq b8
>>
File: Untitled.png (31 KB, 1885x177)
31 KB
31 KB PNG
The latents are caching
>>
>>103489595
the splines reticulating
>>
>>103489595
>>103489601
rivulets popping
>>
File: NIGGER NIGGER.mp4 (658 KB, 960x544)
658 KB
658 KB MP4
>>
eugh
>>
is 960x544 the highest resolution possible for consumer gpus or can you go higher?
>>
>>103489639
You can go higher with block swapping. Even 24gb can generate like a second at 720p
>>
>>103489639
can go nuts numbers with just 1 frame
>>
File: HunyuanVideo_00617.mp4 (307 KB, 256x448)
307 KB
307 KB MP4
Ok I was wrong, it can do Taylor Swift. But you have to add things like "on the Eras tour", "a blonde singer with lipstick and a fringe", etc

even then it sometimes gives her kpop face
>>
File: Untitled.png (118 KB, 1582x402)
118 KB
118 KB PNG
I think it's working.
>>
>>103489665
Delete this.
>>
>>103489665
you know what to do now
>>
>>103489665
Now do a collab tour with David Bowie (90's)
>>
File: HunyuanVideo_00621.mp4 (170 KB, 256x448)
170 KB
170 KB MP4
Weird that my gens are getting cooked bad at 3.0 guidance. Something about sexy gens of Taylor Swift is not conducive to good results. Maybe this is that dataset poisoning people were on about
>>
>>103489708
NTA but this will just get you a mashup of bowie and swift
>>
>>103489384
so.. not possible?
>>
>>103489741
it doesn't do two separate subjects?
>>
What should I use to try to upscale old pixel art?
>>
>>103489751
nearest-neighbor
>>
>>103489745
try it yourself. Nobody is wasting their GPUs valuable compute time on your passing curiosity.
>>
File: PA_0032.jpg (509 KB, 3328x1152)
509 KB
509 KB JPG
>>
>>103489049
Status report on linux: I got a black output but I'm sure it's because of a settings issue. Otherwise I installed via comfy ui manager + using conda venv. I had a few errors fixed by just installing whatever packages it complained about through pip (no need for sageattention 2 it seems). The only linux specific issue is the default filepath for the vae uses windows path hyvid\ where linux would use hyvid/. But just delete the "hyvid\" part and put the link to your vae directly i.e. dont put it in a subfolder. I'll report when I fix the black screen issue.
>>
>>103489765
ok, no gen gpulet
>>
>>103489795

I'm training a LoRA for hyvid right now. Can't deal with your shit.
>>
>>103489730
I mean your micro tiny resolution is probably having a bigger impact
>>
>>103489784
https://github.com/kijai/ComfyUI-HunyuanVideoWrapper/issues/99
maybe this is related?
>>
are people gonna start judging each other in these threads by what resolution they're able to gen at
"look at this faggot and his tiny microdick resolution"

would be funny if so
>>
>>103487099
>>103488346
with what model and prompt was the girl on the left created?
>>
>>103489876
They'll try. I don't think it will catch on.
>>
>>103489876
mini ani had everyone cheering for him despite staying with 384x384
>>
>>103489876
Resolution and aspect ratio has a huge impact on the output
But regardless vramlets should be shamed at every possible opportunity
>>
>>103489474
>>103489894
go back
>>
File: HunyuanVideo_00647.mp4 (93 KB, 256x448)
93 KB
93 KB MP4
>>103489820
The resolution doesn't tend to overcook by default. Here's a more typical sort of result from a normal prompt, also 3.0 guidance.

Taylor Swift gens cook more. I don't know why.
>>
Damnit I did it again
>>
>>103489974
you poopied your butthole again?
>>
>He pulled
>>
>He poopied
>>
File: HunyuanVideo_00126.mp4 (339 KB, 960x544)
339 KB
339 KB MP4
looks like we got some comedians in the thread tonight
>>
File: HunyuanVideo_00001.mp4 (654 KB, 2048x768)
654 KB
654 KB MP4
>>103489572
Still haven't figured out the best settings for rf-inversion for this kind of thing
>>
What the fuck is RF inversion? I fell asleep and now it's the hot net shit.
>>
File: HunyuanVideo_00127.mp4 (298 KB, 960x544)
298 KB
298 KB MP4
>>
>>103490105
It reconstructs latents from an existing source so you can operate on the embeddings rather than on pixels. So you have a closer ground truth to start with.
>>
Unironically make sure to back up and save all models and source, even for much older versions. Pedo's gonna bring the law down on this shit and they'll be forced to remove and heavily censor the models,
>>
File: HunyuanVideo_00129.mp4 (915 KB, 1200x680)
915 KB
915 KB MP4
>>
>>103490086
He should be fat enough to touch type
>>
vram chads i will join you at some point in valhalla
>>
>>103490136
kek I used trolling and a rickroll appeared, this model is pretty smart wtf
>>
>>103490134
Pedo was doing this with flux and nothing happening. Fuck him though.
>>
>>103490086
wow he's literally me

>>103490101
even though it's not even following the original clip whatsoever, i'm impressed hunyuan has enough training on tink to pull off the disney 3d style so perfectly.
>>
>>103490162
>i'm impressed China doesnt give a shit about copyright
>>
>>103490168
I, for one, am
>>
>>103490168
dont be a smartass or ill be forced to joke about your poopy butthole
>>
File: Untitled.png (105 KB, 1293x794)
105 KB
105 KB PNG
What do you think he trained?
>>
>>103490183
primary school crush
>>
File: HunyuanVideo_00099.webm (1.27 MB, 960x544)
1.27 MB
1.27 MB WEBM
>her shiny skin is oiled and glistening
it's like MSG but for your prompt (i shouldve remembered this from imagegen). big ups to elf ASMR anon
>>
>>103490168
it's all fair use, nigga
>>
>>103490183
Copyright Protected ("CP") content
>>
Why are videos seen as more dangerous than images. It's all just pixels, man.
>>
File: HunyuanVideo_00132.mp4 (374 KB, 960x544)
374 KB
374 KB MP4
>>103490183
>greyscale lora

hey look everybody, i trained a black and white lora, the trigger token is "black and white video", it works 100% , trust me
>>
>>103490221
Images imply, videos tell.
>>
File: HunyuanVideo_00096.webm (675 KB, 960x544)
675 KB
675 KB WEBM
la creatura

>>103490221
think of the pixels anon!
>>
>>103490236
We both know that's bullshit.
>>
Can we go back to posting sweaty Korean bitches instead of this shit
>>
File: HunyuanVideo_00003.mp4 (58 KB, 512x512)
58 KB
58 KB MP4
>>103489855
This solved my issue. I had to run pip install torch==2.5.1.
I want to confirm this is the expected output? Also, changing attention_mode to sageattn vs sdpa gives me a speed up from 128.81 seconds to 89.9 seconds. Reserved memory went from 8.8GB to 8GB. This is with the "hyvideo_lowvram_blockswap_test" example
>>
File: HunyuanVideo_00135.mp4 (564 KB, 1200x680)
564 KB
564 KB MP4
>>
File: 00120-568261850.png (2.65 MB, 1080x1576)
2.65 MB
2.65 MB PNG
man i shouldn't be hungry at 9PM
>>
>>103490236
what does that mean, on any level
>>
>>103490183
It's fairly normal stuff, just not something I'm gonna link to my normie github I put on my resume and shit.
>>
>>103490107
figuratively me
>>
>>103490314
why not moving
>>
>>103490335
Oh you're here? I'm training a LoRA now. Hope you didn't bog my computer.
>>
File: HunyuanVideo_00056.mp4 (1.96 MB, 544x960)
1.96 MB
1.96 MB MP4
>>103489730
4.0 guidance
>>
File: bogdanoff meme1.jpg (20 KB, 400x400)
20 KB
20 KB JPG
>>103490347
>he's training?
>pull it
>>
>>103490351
now make her oiled and glistening
>>
So how are we feeling about Hyvid? Are we dooming or blooming?
>>
>>103490351
now give her down syndrome
>>
File: HunyuanVideo_00137.mp4 (514 KB, 1200x680)
514 KB
514 KB MP4
>>103490351
oh no, how long until this model gets banned? You know they will start banning ai models like they do with books and movies
>>
>>103490347
I'm always lurking.
>Hope you didn't bog my computer
You are right to be skeptical, I always am of open source projects that are brand new. There is actually a surprisingly low amount of lines of code considering everything the training script supports. You could scan through everything in probably like 20 minutes and confirm it's not doing anything sus.
>>
>>103490370
>You could scan through everything in probably like 20 minutes and confirm it's not doing anything sus.
You know I won't.
>>
>>103490351
seconding >>103490357 and >>103490364 and also make her ten years younger
>>
>>103490369
you should see that billie eillish vid someone genned on aco, its nuts. my coomer brain desires a completely nude version. >>103487645
>its one of the first posts in that thread i dunno how to crossboard link
>>
>>103490357
>>103490394
I have no further interest in making celebslop. I was just testing that anons prompt and proving it has nothing to do with guidance scale
>>
>computer, show me a young, oiled, glistening, retarded, taylor switft
>>
I don't see the appeal of celebrities myself, show me an oiled up megumin cosplay letting off an explosion.
>>
>>103490404
>MY POV AS I'M GLIDING IN FOR THE SNIIIIIIIIIFFFFF
>>
Guys I'm browsing from work...
>>
File: HunyuanVideo_000212.mp4 (754 KB, 1088x960)
754 KB
754 KB MP4
>>103490370
Have you tried rf-inversion? since you can code, can you add an option to edit the sigmas in the inverse sampler and resampler? using multiply_sigmas you can maintain consistency from the original source, I've tried it with rf-inversion in flux and works very well
>>
>>103490306
What's the difference? All women look the same.
>>
>>103490422
Right has some dry ass lips
>>
>>103490404
This is SFW...
>>
>trani gets btfo'ed by normal anons episode
Always the best
>>
File: HunyuanVideo_00142.mp4 (373 KB, 960x544)
373 KB
373 KB MP4
>>
File: HunyuanVideo_00144.mp4 (655 KB, 1200x680)
655 KB
655 KB MP4
>>
>>103490451
As always the Jannies do fucking nothing about ban evasion. It's a miracle people aren't just spamming CSAM all day
>>
>>103490422
>can you
I probably could but I won't. Never used rf-inversion, I'm just not interested in any vid2vid type stuff, sorry.
>>
Completely, 100% serious. How long until AI gets censored or regulated? I mean just look at this thread, it's going to happen eventually. I give it about 4-5 years
>>
>>103490483
>>103490508
dont stop experimenting anon. when you find the god prompt share it with us
>>
>>103490532
>Let's ban math!
But unfortunately the pedo is making a good case for it.
>>
>>103490532
Ultimately it's about outputs. It's no different than if he was Uncle Anon posting in 2019 with pedo bait pictures from Facebook. The problem is the jannies are fucking retarded and this site has done almost nothing to prevent the completely circumvention of filters and ban evasion.
>>
>>103490461
Not nearly retarded enough looking
>>
>Sora the slopped
>"revealed"
>inconsistency of 2D gen again
again, 3D gen is the future.
>>
>>103490532
in 4-5 years you'll be able to generate a 45 minute porno of a random girl from her highschool yearbook photo so regulations will not affect anything
also remember regulations = china wins
>>
>>103490550
Fortunately for everyone that won't happen because you'll need to be mildly affluent to afford the $10k in hardware.
>>
>pedoanon weirdo posts
>someone starts concern posting about ai getting banned
This routine is getting tiresome my glowing friend
>>
>>103490550
I don't think Trump will do anything. But I still think 2025 will be the year some event or something happens with AI that brings it under scrutiny to be regulated. If anything happened, they would probably go after open source and online projects, taking down old models and forcing compliance with a new model that has filters and prevents gens
>>
>>103490555
lol I remember arguing with people on this site only a few MONTHS ago saying we will be able to have local video sometime soon, and they were saying it wont happen for a very long time due to this and that lol

So I wouldn't be so sure.
>>
>>103490568
>3 second videos
>requires a $1800 GPU to do it
>>
>>103490561
Glow boy could easily get a raise by targeting trani but he won't
>>
I have yet to see a good single frame Hunyuan gen. They all look worse than what image models can already output.
>>
>>103490573
Using 3060 12gb here actually, and it will only get easier.
>>
>>103490564
Regulation can't happen because we have the first amendment here, but what can and should be regulated is aggressive prosecution of people who distribute their illegal outputs.
>>
>>103490550
>regulations = china wins

People would die than let people accusing them being of some kind of -ist so this is going to happen anyway.
>>
>>103490568
people in /lmg/ were coping hard that anything lower than 70b would ever reach a level of usability and now the thread has psyopped itself into basically every model being unusable, even mistral large kek
the level of mental gymnastics people will do to justify fleeting stages of tech advancement is insane. doesnt matter if one uses an old gpu with 8gb of vram or the latest paypig njudea at 24gb, there's some level of weird elitism and crabbucket mentality going on that'll continue even when we're fucking waifus with advanced every-sight real time AI in VR.
>>
>>103490581
>3 second videos in 30 minutes
Okay buddy whatever you say
>>
What GPU specs are you people running this stuff on? Am i going to have any chance of succeeding with an RX 7800?
>>
>>103490596
>AMD
i'm so sorry man.
>>
>>103490587
>6-12 months between marginal advancements, going to the moon boys in 2 more weeks
>>
File: HunyuanVideo_00148.mp4 (564 KB, 1200x680)
564 KB
564 KB MP4
>>
>>103490581
But yeah to add to this, it's so funny we go over the routine a million times and people sitll don't learn.

>This thing will happen.
>No it never will lol you are dumb
>It happens soon after
>lol it can only do 3 seconds! and well next thing wont ever happen because current technology blah blah

Rinse and repeat. The world changes, it doesn't just stay like the present right
>>
>>103487099
why does it look like the one on the right is saying "your face on my dick"?
>>
>>103490612
and of course the cope;
>y-yeah it's advancing but its actually only marginal
>y-yeah you're all just caught up in le 2 more weeks >>103490604
i don't get it, why even have a mindset like this? what does it accomplish besides making yourself angry all the time?
>>
>>103490591
5 to 10 mins depening on res, which is way better than nothing at all, and is only the beginning. Thinking we won't be able to do longer vids in a few years is pretty dumb anon lol
>>
File: HunyuanVideo_00228.mp4 (276 KB, 512x320)
276 KB
276 KB MP4
>Drumrolls
EVERYBODY

GATHER ROUND!

I PRESENT TO YOU, GOOD GOOD PEOPLE

MY FIRST HYVID LORA
>>
>>103490628
Holy shit kek
>>
>>103490628
madlad
>>
>>103490623
yeah I've seen it so many times so had to rant finally lol
>>
>>103490612
>Flux came out 4 months ago
>still untrainable

>>103490623
Because I'm realistic and you're deluded. At the rate of progression we might have 60 second videos in 5 years.
>>
>>103490587
yeah it's like deja vu every time lol
>>
>>103490618
i was wondering the other day about someone using a lipreading model to make these generated hoes speak
>>103490637
>>Flux came out 4 months ago
>>still untrainable
ong where are the dedistilled tunes
>>
>>103490628
>enters your A.I thread uninvited
>takes over
how does he do it? great job on the lora that's insane.
>>
Any progress with local model voice cloning and generation?
>>
>>103490637
Flux training is bad I agree
>>
File: HunyuanVideo_00150.mp4 (535 KB, 1200x680)
535 KB
535 KB MP4
>>
>>103490642
>the entire progress of local LLMs is entirely based on the grace of Meta
>>
>>103490654
nope. sota 6 months ago is still king
>>
File: HunyuanVideo_00229.mp4 (294 KB, 512x320)
294 KB
294 KB MP4
>>103490649
Thanks. It's about as simple and receptive as Flux is. Good things are coming...
>>
>>103490628
would be hilarious if he stole your moving picture
>>
>>103490671
game over man, it's game over
>>
>>103490676
You're replying to the man himself
>>
>>103490676
It would be hilarious if someone posted all this to reddit and the settings they used before he could do it himself and steal his moment.
>>
>>103490654
How is it voice AI was some of the first, and it's gotten incredibly good too, but there are no local models for it. Theres so many sites that have whole libraries of cloning
>>
File: HunhuyanVideo_00139.mp4 (1.95 MB, 768x432)
1.95 MB
1.95 MB MP4
>>103490581
what resolution are you using anon? I'm struggling to get decent quality unless i over bake in another sampler second pass same seed even on 1 cfg.
>>
File: HunyuanVideo_00151.mp4 (209 KB, 960x544)
209 KB
209 KB MP4
>>
File: 1724403180079042.gif (316 KB, 450x359)
316 KB
316 KB GIF
>>103487099
BOOBA BOOBA BOOBA BOOBA BOOBA BOOBA BOOBA BOOBA BOOBA BOOBA BOOBA BOOBA BOOBA BOOBA BOOBA BOOBA BOOBA BOOBA BOOBA BOOBA
>>
File: this kills the crabs.png (1.96 MB, 1080x1576)
1.96 MB
1.96 MB PNG
>>103490671
that is seriously baffling, is this the first LORA trained in this thread so far?
of course it had to be of this fucking guy, naturally.

>big question is are you gonna go ahead with the bogdanoff lora now?
>>
File: kaiba-clone.mp4 (311 KB, 640x470)
311 KB
311 KB MP4
Again, what we need is (Text|2D) to (3D model) model and
Text to Animation file (classic time series data to apply to 3D) model,
and Yes, we will get to share the generated module files.
Eventually only the best single or a few gen may survive.
Then we will share only the list of seeds.
Either way, at some stage the cost and need for generation will rapidly decline.
That is the future I predict and hope for, sir.
>>
File: HunyuanVideo_00230.mp4 (151 KB, 512x320)
151 KB
151 KB MP4
>>103490697
>big question is are you gonna go ahead with the bogdanoff lora now?

Yeah for sure, I got the dataset ready to go. I gotta actually use my GPU for work now so when that's done I'll serve up a bog LoRA for everyone to try.
>>
File: 305b2mnbfzuc1.jpg (18 KB, 303x326)
18 KB
18 KB JPG
>>
>>103490671
make him suk a dik lol
>>
>>103490706
Kek, that is literally us except using this primitive video-only technology.
>>
>>103490705
Trained off just images? I can't imagine there's videos of this guy.
Did you caption just like flux?
>>
>>103490716
30 images, no captions.
Honestly dead simple. Props to the guys who wrote the training script and curse you for making me use wsl2
>>
>>103490690
>>103490669
Guess I'll check in in another 6 months, keep up the grind anons I expect flawless minute long anime porn gens by my next visit.
>>
>>103490711
lol i might try those exact words as a prompt to see what happens, for scientific reasons of course.
>>
>>103490671
Are you prompting the videos to be black-and-white, or do they do that on their own?

I ask because like 5-10% of the videos with my loras I've trained will be black and white for no reason. I have no idea why.
>>
>>103490705
absolute fucking legend, i look forward to the boggdening
>not like i was getting sleep anyawy
>>
File: HunyuanVideo_00154.mp4 (447 KB, 1104x632)
447 KB
447 KB MP4
>>103490705
can you try prompting two different characters(your lora + donald trump) to see if the lora overfits the prompt? pretty please
>>
>>103490723
that easy? holy shit.
>>
>>103490742
I just thought it would be funny for a noir style reveal and have it be that guy. It generalized really well.
>>
>>103490765
Ok. Maybe it's the fact I'm mainly using CFG for inference as I find that gives better prompt adherence wrt the lora. Or maybe I just fried it a bit somehow.
>>
>>103490776
I have no idea desu. I only had time to do a few gens before I really needed to get my GPU back for work. I look forward to trying it with some bogs and anime later and I'll report back if I get any random monochrome scenes.

But the ease with which this generalizes is big.
>>
>>103490683
I can tell this a lie, because there is no mention of their patreon
>>
Too clarify, I chose the grifter as the first LoRA I'd train because nobody else really seemed to understand the potential. It was kind of a way of punishing you for not paying attention.
>>
>>103490795
>But the ease with which this generalizes is big.
True. I won't say exactly what I trained, but its ability to generalize to video from images alone blew my mind. It's one of the few times something has truly felt like magic since I started using all this genAI stuff back in early 2023.
>>
>>103490610
Are you that same guy who writes extremely schizo prompts like
>Establishment close up shot of a mirthful hyperborean forgiveness goddess with comically gigantic pectoral proportions in a sterile, liminal space. Glamour reveal of her Natural beauty. Vertigo zoom effect. Blonde hair with bangs. Direct eye contact. She wears a thin and wet black microkini. sHe does stationary jumping jacks in the space with fully stationary arms. Liminal. hypergamous realism. feminine figure of empowerment. in the style of a hypergamous hyper-realistic exercise music video. Directed by Russ Meyer for Loaded magazine. Q-fast paced edite. Her movements are unrestricted both voluntarily and involuntarily.
>>
>>103490758
how many steps are you doing?
>>
>>103490849
kek is this real? what unholy gen did this pop out?
>>
>>103490628
i see a collage in this videos future
>>
File: HunyuanVideo_00037.mp4 (402 KB, 640x400)
402 KB
402 KB MP4
Trying to make news reporters, I wanted the wind to blow her clothes off, no success.
>>
File: HunyuanVideo_00060.webm (323 KB, 352x640)
323 KB
323 KB WEBM
>>103490881
It doesn't get remarkable results in Hunyuan, but it gave some pretty bizarre results when plugged into https://hailuoai.video/
>>
>>103490928
Instead of typing "A woman walks down the street, then the wind blows her clothes off" you could just type "A naked woman walks down the street."
>>
File: HunyuanVideo_00038.mp4 (226 KB, 640x400)
226 KB
226 KB MP4
>>103490692
Right now I'm using this resolution at 81 frames, which takes about 11 mins, doesn't look amazing because I think this model was trained to work best at 720p.

I use bnb_nf4 setting for the text loader, and using 20 double block swaps with 0 single block swaps for the block swapper thing.

I can go higher res and frames, but it risks going out of memory randomly and takes forever lol, at lower res I can get videos in 2 to 6 mins but obviously doesn't look as good.

>>103490934
But that ruins the fun of the wind blowing her clothes off!
>>
>>103490973
I'm up to 768x432 97 frames but I do have to use a lot of block swapping like 20/15 and vae tiling at 64/128 if I try to up the resolution any more or increase block swapping my computer will just lock up eventually... anyway sending the samples from the first sampler into another sampler you can get better quality doing a second pass same seed, just got to experiment with flow_shift and cfg more on second sampler as not to alter the result too much, anyway just try and you will see what i mean.
>>
>>103490973
put her in a wind tunnel and see what happens
>>
File: Hunyuaerweerwe (3).mp4 (1.34 MB, 960x544)
1.34 MB
1.34 MB MP4
check out this Rapeman intro I made:

https://files.catbox.moe/mc3nqd.mp4

Lyrics:
Rapeman! The crimson savior,
A warrior born to face behavior.
Through the smoke and fire, he stands tall,
A hero answering the city’s call.
Rapeman! The primal force,
Breaking through with relentless intercourse.
No face, no name, just justice's flame,
Rapeman, they’ll remember your name!

Every scar, every fight,
Every rape in the dead of night.
He bears it all, he feels their anal pain,
But through the darkness, he’ll remain.
>>
>>103490705
if possible please writeup your process in a rentry anon! 30 images no caption -> lora of a character seems too good to be true but your videos are proof that it works and many anons will follow in your steps if you write down what you did
>>
>>103490973
>>103491003
any how, you can read about all that from another guys experiments here https://github.com/kijai/ComfyUI-HunyuanVideoWrapper/issues/75
>>
I don't like Rapeman
>>
>>103491018
FUUCKK YYEEAAHH the RAPEMAN season 1 anime opening lets GO
>>
Currently in the middle of a nightmare trying to install deepspeed. I just want to make a fun video lora. It's always some bullshit esoteric python package that gives me problem
>>
>>103491024
I'm kind of interested in experimenting with just creating random seed travel frames from a pony model into a video combine and then feeding that into Hunyuan to see what happens, the old vid2vid way that is. If all it needed for lora was still images, who knows.
>>
>>103491024
I'll do a writeup when I get home. I've set my bogdanoff dataset up to run while I ride home.

It's a bit of an unusual dataset though. It's set of tiled 1024x1024 images that I'm hoping the bucketing will cut into 4 512x512 images. It worked well for flux so we'll see if it works here too.
>>
how about... rapegirl
>>
imo a rapeman gen should have a 1girl and rapeman bursts into the video at the very last second
>>
>>103491094
+1 on the comedic timing idea
>>
>still thinking anyone can control their videos
>>
>>103491155
I will be the first.
>>
n slur
>>
>>103491046
im not interested in that because i dont really understand why that would make anything interesting, so you should do it anon because i always think that someone should take advantage of the unique circumstances they're in to do things others cannot
>>
>>103490101
>ai will replace artists
>>
File: HunyuanVideo_00674.mp4 (105 KB, 448x256)
105 KB
105 KB MP4
I'll get it right eventually
>>
>>103489049
I've been using it on Linux ever since it came out.
>>
>>103491402
>>103491402
new
>>103491402
>>103491402
>>
>>103490351
oh shit, Hunyuan knows Taylor? fucking nice
>>
>>103490628
HOLY FUCKKKKKKKKKKK, and like you managed to get this with just images training? that's craaaaaaaazy
>>
>>103491003
Interesting, I'll have to try that out then.
>>
unintended MILF boobs. obviously NSFW,

https://files.catbox.moe/dfsitg.mp4
>>
>>103491418
Yeah loads of people film themselves in cars for some reason lol



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.