[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: HunyuanVideo_00002.mp4 (514 KB, 768x432)
514 KB
514 KB MP4
Discussion of Free and Open-Source Diffusion models.

7 minutes is too long to wait for a single video gen Edition

Previous: >>103513104

>UI
Metastable: https://metastable.studio
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
reForge: https://github.com/Panchovix/stable-diffusion-webui-reForge
ComfyUI: https://github.com/comfyanonymous/ComfyUI
InvokeAI: https://github.com/invoke-ai/InvokeAI

>Models, LoRAs, & Upscalers
https://civitai.com
https://tensor.art/
https://openmodeldb.info

>Training
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts

>HunyuanVideo
Comfy: https://github.com/kijai/ComfyUI-HunyuanVideoWrapper/
Windows: https://rentry.org/crhcqq54
Training: https://github.com/tdrussell/diffusion-pipe

>Flux
Forge Guide: https://github.com/lllyasviel/stable-diffusion-webui-forge/discussions/1050
ComfyUI Guide: https://comfyanonymous.github.io/ComfyUI_examples/flux
DeDistilled Quants: https://huggingface.co/TheYuriLover/flux-dev-de-distill-GGUF/tree/main

>Misc
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Generate Prompt from Image: https://huggingface.co/spaces/fancyfeast/joy-caption-alpha-two
Archived: https://rentry.org/sdg-link
Samplers: https://stable-diffusion-art.com/samplers/
Open-Source Digital Art Software: https://krita.org/en/
Txt2Img Plugin: https://kritaaidiffusion.com/
Collagebaker: https://www.befunky.com/create/collage/
Video Collagebaker: https://kdenlive.org/en/

>Neighbo(u)rs
>>>/aco/sdg
>>>/aco/aivg
>>>/b/degen
>>>/c/kdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/tg/slop
>>>/trash/sdg
>>>/u/udg
>>>/vt/vtai

>Texting Neighbo(u)r
>>>/g/lmg
>>
Blessed thread of frenship
>>
https://civitai.com/posts/10256589
Holy shit...
>>
>>103519758
fuck, the workflow is only about the sound part, not on how he made those videos in the first place
>>
>>103519758
well it did better than i thought it would.
>>
If there was a button that deleted all video tech I would press it
>>
>>103519730

yeah it's the same censorship too, so not the pony of this model yet
>>
>>103519786
good thing you don't have that power Stalin
>>
ok so other than block swapping what are the big low vram optimizations I need to install to get my frames up?
>>
>>103519796
You need to install a new gpu.
>>
File: HunyuanVideo_00008.mp4 (544 KB, 512x512)
544 KB
544 KB MP4
>>
>>103519758
https://civitai.com/images/45265861
this one actually looks good and not like some girl having epilepsy
>>
>>103519796
it's the biggest one, rest doesn't matter
just be very patient
>>
>>103519807
What did you prompt for that?
I tried to make it generate a vr video but had no luck.
>>
>>103519668
where collage
>>
>>103519811
>just be very patient
My gen times are fast. I don't need speed, I need memory tricks.
>>
>>103519796
Install more vram https://youtu.be/14jzlR4yGCQ
>>
>>103519824
>video shot with a 180 degree fisheye lens
>>
File: 1729365460928641.png (237 KB, 1219x1399)
237 KB
237 KB PNG
>>103519808
>this one actually looks good
it's just a matter of luck, it's still too inconsistent, but the simple fact that it can do porn out of the box is already insane
>>
>>103519887
Man, just a couple of times is great, I hope he'll share the way he does it if it's not too schizo.
I'm not interested on the audio, though it's a nice bonus.
>>
>>103519887
the fun thing about porn being one of the most plentiful datasets with all positions, styles and ideas possibles, and yet here we are trying to get it to do simple penetration
>>
>>103519946
It's possible that it didn't train on that many videos of porn, but instead was more images of porn so it doesn't quite understand the movement yet
>>
so next SOTA will be a video, image, and audio model all in one, right?
>>
>>103519946
>we are trying to get it to do simple penetration
for a base model that's already impressive, what other base model can do porn? none, Flux doesn't know what a penis is, SD3 can't even lie a woman on the ground lol
>>
>>103519956
Oh for sure it's that, it doesn't seem to understand penetration (and most positions) by default.
So they probably avoided adding most actual videos of porn.
>>
>>103519967
img to video
more than 5s while staying coherent
actual sexual stuff for both 3d and 2d
>>
>>103519967
>next SOTA will be a video, image, and audio model all in one, right?
Hunyuan can do video, image and audio by itself
https://www.youtube.com/watch?v=6MISaOhNqmg
>>
File: 1731057586584939.mp4 (730 KB, 720x720)
730 KB
730 KB MP4
https://www.reddit.com/r/StableDiffusion/comments/1hedg7a/improving_hunyuanvideo_with_clip_finetune_factor/
That guys always sounds insane but it looks like he knows what he's doing
>>
>>103520061
As long as it's not that Turkish lunatic I'd take a look.
>>
>>103520061
>But, I think by giving my SAE-CLIP finetune the right OVERLORD influence (top right), the result is best (happy to have you disagree, if you do!).

tbqh his clip is the second worst
>>
>>103520061
>AI & I do prompt engineering towards prompt criticality.
can't take this guy seriously.
>>
>>103520125
I shouldn't either, but I like his clip finetunes, they are definitely better than base clip_l, still using smooth to this day
>>
>>103520061
His versions are the worst there
>>
>>103520136
>they are definitely better than base clip_l
I never switched to using it myself. From what I remember didn't look like much of an improvement so skipped over it.
>>
>
>>
So anyways
>>
>pull
>Get a cuda error during vae decode or the terminal just closes
I will never learn
>>
this is it
https://civitai.com/models/1038199/nsfw-hunyuan-lora?modelVersionId=1164548
>>
>>103520658
Would be it if he trained it on irl and not 2d stuff.
Hunyuan sucks at 2d.
>>
>>103520705
maybe it can work on irl stuff, I saw the workflow he uses "hentai" at the end of the prompt, you remove that and see if it makes it realistic
>>
File: 1733288261062942.png (36 KB, 276x292)
36 KB
36 KB PNG
>>103520658
Good. ACCELERATE. The lora era is HERE.
>>
>>103520658

What are the odds that we’ll be able to use both a motion Lora and a character Lora together? Asking for a friend of course.


Although you might not need both if we’ll get a coherent image to video setup.
>>
>>103520796
>What are the odds that we’ll be able to use both a motion Lora and a character Lora together?
I'd say good odds? stacking multiple loras has always worked on other models
>>
>>103520658
https://www.youtube.com/watch?v=Q929ZMezRvs
>Mr Bones - There are no brains
>>
File: unnamed.jpg (71 KB, 900x900)
71 KB
71 KB JPG
>>103520855
>brains
brakes, fuck my brain. There are no brakes on this train at all. Just awaiting the fucking normie melt down. N-NO YOU C-CANT JUST MAKE AI VIDEO PORN!!! HOW HORRIFYING SOCIETY WILL SURELY COLLAPSE, THE FUCKING SKY IS FALLING.
>>
>>103520885
>Just awaiting the fucking normie melt down
same, where's the meltdown on hunyuan? this shit is as uncensored as it gets and no luddite seems to give a fuck, what's happening?
>>
>>103520895
ahahaha, i remember those idiots saying we would never have something like this back in the summer and i said "I give it till the end of the year and you will eat those words" I am an 80's kid and I've seen before how fast new tech moves once it gains interest.
>>
File: 1704374245029521.png (411 KB, 959x949)
411 KB
411 KB PNG
>>103520920
>i dont think anyone really knows about it desu

I highly doubt that, it's third on the huggingface popularity list right now
>>
>>103520934
how do you think the normies get the AI info from? I guess from the media, but the media knows how to look at the new hot thing, they know about hunyuan there's no way they don't
>>
>>103520915
Same I remember saying the same to them and pointing out how things always chnage and how everyone always acts like everything stays the same, yet they laughed at the idea of local video lol
>>
in the last 48 hours we just blew everything apart, down with the likes of pornhub and its scummy industry that created the problem, we are the fucking solution and the fucking absolute. We will crush that entire industry for exploiting us. Pretty soon everyone will be bored of porn because they can get their instant fix anytime they want all for free.

Just you wait they will be kicking off about us anons :-)
>>
it's getting really schizo in here
>>
>>103520979
why?
>>
>>103520895
>this shit is as uncensored as it gets and no luddite seems to give a fuck
I hope it'll stay that way, so that it could give some balls to the other AI companies like SAI and show to them that it's ok to release uncensored models
>>
>>103520966
mate i've some ideas in my head to make an even better video model since using this thing and coming to realize how it works. A one that would work way fast and taking a much different approach, using only stable diffusion models like SDXL and a bit of math to work out differences between images and then using a sorting algo. This truly is the beginning, those normies better buckle up.
>>
https://www.youtube.com/watch?v=n4RjJKxsamQ
>>
>>103521002
Sounds interesting :o, and yeah new ideas and techniques will always come too, so if one thing gets to a dead end and the door closes, another door always opens.

People often calculate the future based on what we have today rather than what we might have in the future haha
>>
>>103520658
give who done this your buzz
>>
File: HunyuanVideo_00342.mp4 (139 KB, 640x400)
139 KB
139 KB MP4
>>
>>103520915
ok boomer
>>
>>103521042
true, it's weird when everything is going smoothly, it shouldn't be
>>
>>103521038
if we use 2 reference frames, one is the start frame and the other is the end frame, then we gen a lot of images from one prompt and then do some image difference math, then sort them in logical order, then we have our flow right? Ah still thinking about it, i dream in stable diffusion FFS...
>>
>>103521038
and yeah tagger models are getting rather good, it wouldn't take much, but would it be faster? I don't know, but with the likes of sdxl and pony we have lcm which is very fast. I have a pony lcm weights lora, it was the first ever its fairly recent it just needs to be used at a low weight like 0.2 and no negative and cfg 1 and its really fast.
>>
>>103520986
We're accelerating too fast, so the event horizon looks schizo. It's crazy because yesterday morning we had static cowgirl sex with no movement. Then, cowgirl with movement but fucking demonic spasms. Then a gen last night so peak that people were begging the anon for his workflow. Yet it's already obsoleted by new civitai loras. And these are literally alpha loras on an alpha lora training code on kijai's experimental (and constantly breaking) node for a base model that hasn't even finished development yet (they promised MLLM, i2v, more vram/gpu splitting improvements, etc etc etc). Oh, and we're using fp8 not a proper Q8 gguf quant.
>>
>>103521133
>We're accelerating too fast
>a base model that hasn't even finished development yet (they promised MLLM, i2v, more vram/gpu splitting improvements, etc etc etc). Oh, and we're using fp8 not a proper Q8 gguf quant.
that alone shows it's not accelerating fast enough, I to want a Q8 quant and MLLM, that alone will make the model much better overall
>>
>>103521133
The schizo, ladies and gentlemen.
>>
>>103520658
>>103520705
>Would be it if he trained it on irl and not 2d stuff.
it works well on irl stuff if you decrease its strength to something like 0.7 or 0.5
>>
>>103521175
might work, but might still feel like an animation made in blender hmmm
>>
File: HunyuanVideo_00016.mp4 (601 KB, 832x624)
601 KB
601 KB MP4
How do trains work
>>
>>103521187
well, he trained on 3d renders so yeah those are blender animations
>>
File: 1656737202117.jpg (61 KB, 640x640)
61 KB
61 KB JPG
>>103520973
Nah, reddit, pornhub and the rest of them won big time. They managed to kill off the real amateur porn by nuking it all from orbit. Thousands of terabytes of the genuine stuff from 00s and early 10s, all gone. Replaced by the soulless shit from onlyfans and the others. Horny amateur sluts who were filmed for fun and gained absolutely nothing from it lost to the greedy professional whores, pissing themselves with dead eyes to justify your $5 subscription. I hate this. There's nothing good left out there to train an amateur video lora to recreate those feelings.
>>
>>103521153
Mllm ?
>>
>>103521208
The new generation by default has more of that dead eye look for some reason, something about social media gives them that boring expression.

(of course not of everyone. bu it's something I've noticed)
>>
>>103521133
>begging the anon for his workflow. Yet it's already obsoleted by new civitai loras
aye but i've work on it hard to redeem myself, its being tested, it produces enough to feed the machine and with the lora on top it should be pretty fucking insane. but wait out lad, the future is rewriting the past right now that is how schizo crazy this shit is, this could be really it.
>>
>>103521222
We're not using the official text encoder, the official one is called HunyuanMLLM and they said they haven't released it yet, so for the moment we're actually playing with a duck tape
https://github.com/Tencent/HunyuanVideo/blob/main/ckpts/README.md#download-text-encoder
>>
File: HunyuanVideo_00345.mp4 (198 KB, 640x400)
198 KB
198 KB MP4
>>
>>103521250
We don't know how big a different the official MMLM is over the one we have now.
>>
>>103521265
it's probably a finetune of llama-llava-8b, it can't be too different, for example joycon is also a finetune of L-L-8b and when you plug on hunyuan it gives you nonsensical outputs
>>
>>103521153
its moving rapidly in ways you could not comprehend, did you ever think we are already living inside of it now? Is that air you are breathing right now anon?
>>
>>103521283
I always knew, I just wonder who the real me is behind all this.
>>
>>103521303
one day I just imagine everything will burst into a surreal experience and the sky will open and we will unite into oneness that we will remember. That would be the singularity which is moving rapidly always backwards unraveling the past. What do you think this "car size drones" are? Iranian mothership who the fuck writes this shit kek? A media or a system that is scared of losing control.

project looking glass ring a bell? They knew it was coming and there was no way to avoid it.
>>
>>103521337
There is a connection here, I was actually watching a video about the drones while reading this reply to me....
>>
we local degen general now
>>
File: HunyuanVideo_00049.webm (311 KB, 960x544)
311 KB
311 KB WEBM
>>
File: HunyuanVideo_00346.mp4 (79 KB, 640x400)
79 KB
79 KB MP4
>>
>>103521247
Obsoleted is probably too harsh of a term since it's still very useful. Keep working on it
>>
Just so you know I all think less of your for jumping up and down like screeching baboons while ranting about the future being here because you saw a 3 second clip of a very wobbly penis going into a vagina.
>>
>>103521412
>I all think less of your
*you
>>
>>103521412
>Just so you know I all think less of your for jumping up and down...
Good morning sir
>>
>>103521405
firing the weapon now sir. first test of new method. I have the new lora ammo sitting on the platform, it will be loaded next and then fired.
>>
>>103521412
Pretty soon it will be our wobbly penises going inside those vaginas. The future is now old man.
>>
>>103521412
Your english is terrible SAAR
>>
For anyone using Cubey's LoRA, I'm getting the best realistic results by combining with a photographic character LoRA and a prompt that emphasizes specific things like this:

> nsfwsks, a girl is having (missionary sex:1.2) with man out of frame, (his penis is going deep in and out of her pussy:1.3), (repeated motion:1.3), hentai, (ohwa person:1.6), blonde hair, she is (lying on a bed:1.2), she is moaning, the camera is stationary
>>
>>103521412
>Just so you know I all think less of your
How am I gonna recover from this :'(
>>
>>103521430
>Make one typo
>Immediately demoted to street shitter
Insanity.
>>
>>103521432
> (repeated motion:1.3), hentai, (ohwa person:1.6),
you'll get better results by removing the "hentai" token first lol
>>
File: HunyuanVideo_00347.mp4 (195 KB, 640x400)
195 KB
195 KB MP4
>>
>>103520930
It's not going to hit normies yet until someone like Sarkas or Aitrepeneur cover it in one of their tutorial videos.
>>
>>103521441
>one
One? Your whole sentense reeks of curry esl saar.
>>
>>103521448
Incorrect. It goes completely off the rails unless you include that since it's an important part of the training prompt.
>>
>>103521432
>(bob:1.2)
>(vageen:1.8)
These should not work with the current text encoder. I don't know what your reasoning is.
>>
>>103521412
it is a shame these generals have devolved into what they are now.
>>
>>103521470
>Incorrect.
correct, I got good results by removing "hentai" as long as it follows the workflow's settings (640x480x49f)
>>
File: HunyuanVideo_00018.mp4 (624 KB, 832x624)
624 KB
624 KB MP4
I didn't ask for this
>>
>>103521494
that looks cool though
>>
>>103521491
Would one of you fucks post a catbox or I don't believe either of you
>>
>>103521494
Gnarly
>>
>>103521515
NTA but there is no way in hell that (word:1.2) does shit in hyvid
>>
>>103521376
>braaaaaapsquuuueeeepppbrbrbrbr!
Not being funny that was the first thing when i read her expression.

Goodnight
>>
https://www.youtube.com/watch?v=1UUYjd2rjsE

<3
>>
>>103521580
>Scorpions
Thought /ldg/ were just fans of their one album cover, didn't know you liked the music too
>>
>>103521580
https://www.youtube.com/watch?v=oxZxe092eqo
>>
>>103521529
correct it does not understand that and also the negative cfg thing will oom your shit which is a real shame because it was useful. Oh well. Tbh i rarely use negative prompts these days anyway because they can negatively influence the image through restriction.
>>
>>103521580
>>103521629
I don't want to sound rude anon but you lost a lot of credibility by making a schizo workflow and at the same time assuming that going for denoise 1 wouldn't completly destroy the input video in the first place
>>
>>103521529
LMAO you clearly haven't tried it, prompt weighting works just fine in Cunnyun
>>
>>103521083
oh man I love death grips
>>
>>103521638
i mistake i know anon, its not the same as other samplers 1 denoise in hunyaun is not the same. It will replace what every frame with random noise at 1 and then denoise that i realized my mistake after sleep (I can be up 48 hours no joke) and i disclosed that i was wrong but i learn a few things from it. As soon as i knew my mistake i notified them anons in the porn thread, then someone kindly linked it in previous thread on /g/ and label it meme workflow. So that is why am very busy now to make something that actually works to redeem myself for that failure and disappointment.

In reality i know too much anon, and this anon will deliver soon.
>>
>>103521680
like i could put on first image at stage on an ipadater and an image loader for you to load your favorite and have the model make all frames of her face and body and style, but I know not to do that... It will be abused to fuck...
>>
File: 1709840383208939.mp4 (392 KB, 512x768)
392 KB
392 KB MP4
https://civitai.com/models/1038512/super-saiyan-hunyuan-video-lora?modelVersionId=1164936
I'm gonna have so much fun with those loras, and it's just the begining
>>
>>103521657
The burden of proof is on you.
>>
>>103521708
>0 downloads
Would have been more honest if you said "Hey I made a LoRA, try it out."
>>
>>>/aco/8643760
frame rate is a little slow, can fix that, but here you go anons.
>>
>>103521733
lol, I have no idea how to make those things, I'm just on my "f5 spam" phase on civitai, like I did during the first loras of flux
>>
>>103521740
>here you go anons
and like the day before you didn't share a workflow
>>
>>103521751
its coming man you fuck head relax jesus, why be a bitch i have to make sure its right and works god damn. Your nastyness will not prevent me post it so fuck off fed, neck your self
>>
>>103521740
ootl what is this? some kind of hacked together i2v?
>>
>>103521763
you wasted everyone's time yesterday with your broken workflow, I think you should lower your motherfucking tone down, you'll be allowed to talk like a big boi once you'll show a functioning workflow
>>
>>103521774
something special that was i2v interpolated then refined and send in as reference, second stage is wack though i will post it to show in a few.
>>
>>103521788
>>103521784
ahahah yeah what retards to think any one cares...
>>
>>103521740
and you think the one thing ai tech should be used on is the one thing the internet is overflown with: porn
ok
>>
>>103521724
nigger this isn't a court of law, i'm not the district attorney. try it or don't, but you're a smoothbrained promptlet if you don't use weighting with this thing
>>
just scranned some beef stew out of tin, no time for meals. i will drop workflow as is but i'm still working on it. I will probably remove the old concept group because its not good enough, this is better.
>>
or maybe i just fuck you and kept if for myself...
>>
>>103521742
No shame I'm in the same boat.
>>
I'n all honesty i could care a less about you dickheads your all stupid cunts. pic related. I don't any of you contributing, except the lora guys that gave us nice things, all you lot are braindamage
>>
>>103521937
yes, and bottom right unironically the best and he completely ignores it
>>
bye bye
>>
>>103521946
>drama queen
Ikr, he sounds like a chick, I won't be surprised he will troon out in a near future or something
>>
all these fags spending multiple thousands to generate a few seconds of videya that my consumer card will be able to do in a few months
lol !
>>
>>103522071
What are you implying?
>>
>>103522071
> consumer card
> 4gb vram
>>
File: HunyuanVideo_00356.mp4 (546 KB, 640x480)
546 KB
546 KB MP4
we truly post in a local diffusion thread
>>
Is it (Sentence prompt that I want noted.):1.2
Or (Sentence prompt that I want noted.:1.2)
In comfyui with NTRMix and Illustrious models?
>>
>>103522291
Highlight the thing you want emphasized and press ctrl+ on your keyboard and it will do it in the correct format for you.
>>
>>103522357
Ctrl+ up
Fucking 4chan doesnt like the up arrow.
>>
>https://github.com/ai-forever/Kandinsky-4
and noone is talking about it. /ldg/ has fallen
>>
File: 1706291688181080.mp4 (3.06 MB, 1344x768)
3.06 MB
3.06 MB MP4
>>103522557
the video part is CogVideoX tier
>>
>>103522584
>10 seconds
this alone could mean something is there
>>
>>103522613
you can do 10 sec on hunyuan, if you have enough vram
>>
>>103522584

Why does the cat look so nervous?
>>
>>103522621
>if you have enough vram
this is the part that's shitty
>>
File: HunyuanVideo_00068.webm (1.6 MB, 960x960)
1.6 MB
1.6 MB WEBM
>>103522584
just like how llama-3-405b made mistral release mistral large, i feel like hunyuan made these guys release
>>
>>103522706
tummy
>>
>>103522584
why is there only one example of a human and it's just a headshot of a motorcyclist wearing a helmet. Can it do people?
>>
>>103520658
Cant seem to get a good gen using its keyword and something like girl having sex in cowgirl position. Any tips?
>>
>>103522755
uhhh sweaty you should be prompting older woman or mature not girl.
>>
>>103521494
GLORY TO THE PISSBIRD
>>
>>103521494
PRAY TO THE FALCON! RECEIVE GOLD DUST!
>>
I'm seeing that people with 3060s are running HunyuanVideo. Is there a guide for how to get this running on linux? How much RAM do you need if you've only got a 12gb vram?
>>
Is there a reason that the prompt is truncated to 256 tokens when the Llava model is supposed to handle up to 8192?
>>
>>103522973
unknown
>>
>>103523283
Don't fucking reply to me, pedo.
>>
>>103522912
I'm pretty close to the limit with 32gb, I could squeeze more out of it with block swapping if I had more, but it would be so fucking slow that I doubt it would be worth it
>>
File: me and the boys GOONs.jpg (268 KB, 2048x1535)
268 KB
268 KB JPG
>every time i go to sleep then wake up it ACCELERATES
>>
Working late tonight Agent Gonzalez?
>>
>>103523340
so you're saying there's no point in trying with a 3060?
>>
>>103523525
if you have one then what are you doing talking, go for it
if you don't then I wouldn't recommend getting one for hunyuan
>>
>125 seconds per step
>450 steps (for a trial)
that's... oh fuck.
but hey, it's working
>>
File: HunyuanVideo_00153.mp4 (1.16 MB, 960x544)
1.16 MB
1.16 MB MP4
Seems like Hunyuan can handle 3 more seconds for total of 193 frames. But only at higher resolutions it seems. Not just skipping around, but a genuine smooth continuation of the previous frames. Catbox (nsfw) was the first gen with 10 steps flow=17, this is the 30step flow=7 version that is somehow no longer nude lol so I can post it directly. If you try this with lower resolution you get nightmare fuel.
https://files.catbox.moe/1agjhn.mp4
>>
>>103523416
>this post got deleted too
was he samefagging and calling himself based?
>>
>>103520796
once we get image-to-view character video loras will become largely irrelevant.
>>
>>103523442
He was replying to himself saying shit like "your settings are so good, post them! I'm also genning cunny" and "based pedo, you're the best" lmfao

Feels like something weird is going on just because this one's from China. Someone wants it all shut down.
>>
>>103523713
What are the other settings/your hardware? I was getting 110seconds ish for 960x544@195f with some offloading on a 3090. We need to be able to split the vram between gpus so bad.
>>
File: 00011-2861429401.png (1.26 MB, 920x1224)
1.26 MB
1.26 MB PNG
>>103523798
yeah we established that like 10 threads ago
its weird stuff cause its not like its subtle, and when enough anons point it out, they go nuclear and start randomly starting arguments with anons having normal conversations and doing the spiderman meme kek
lets just hope we can keep getting advancements and not end up with this shit suddenly shut down in 3 months or so.

>i cant share a single thing ive been genning all month except something like picrel since its all explicit nsfw
>>
>>103523820
Why didn't you make a video out of this?
>>
>>103523809
I'm training a lora, I didn't manage to get this far yesterday and I'm probably gonna kill it after the first checkpoint
>>
>>103523843
>gtx 1080 ti
>>
>>103521412
>because you saw a 3 second clip of a very wobbly penis going into a vagina.
this is like Neil Armstrong stepping on the moon
>>
>>103521412
>3 second clip
We've already have 8 seconds of coherent video confirmed. Possibly 10 seconds. And this is just the beginning.
>>
>>103523867
Neil Armstrong landed on the moon and then nothing much newer happened after that. Not a very auspicious example.
>>
File: HunyuanVideo_00363.mp4 (518 KB, 640x480)
518 KB
518 KB MP4
>Omg more porn!
>>
>>103523820
I wonder if we'll see another round of agitation about AI dangers/etc.
>>
File: chatlog (44).png (484 KB, 830x1467)
484 KB
484 KB PNG
3.33 Eva seems fun.
Using this atm: https://files.catbox.moe/3vr6k0.json
>>
>>103524116
>(Lick.)
on par with picrel
>>
>>103524116
Oh, and I'm playing with just using some tfs instead of min p. Trying to find a balance of fun but smart even on contextless dumb stuff like this
>>
>>103524126
Meh, Im just trying to see how it writes / acts. Swear I've got actual quality shit elsewhere.
>>
>>103524116
ew
>>
>>103524116
>>103524126
>>103524131
>>103524138
Wrong thread
>>
>>103524147
Recommend some normie card. Chub is full of hot garbage even if I sort by likes or whatever.
>>
>>103524147
i don't care *gives you a wedgie*
>>
File: EgEG_dZXkAE9BjT.jpg (63 KB, 1080x776)
63 KB
63 KB JPG
>>103523864
>>
>>103523962
Sex and war is the driving force of humanity. It's all these models will truly be made for, everything else is just derivative.
>>
what's the skinny on local model (possibly diffusion? I dunno) trellis
>>
>>103524168
yeah but at least i can still gen in sdxl fine enough
at least.
>>
>>103524116
Based horse enjoyer.
>>
File: HunyuanVideo_00367.mp4 (423 KB, 640x480)
423 KB
423 KB MP4
>He goons to 3 second clips he spent 8 minutes generating
>>
File: HunyuanVideo_00368.mp4 (612 KB, 640x480)
612 KB
612 KB MP4
So hot
>>
File: HunyuanVideo_00370.mp4 (162 KB, 640x480)
162 KB
162 KB MP4
>>
>>103524506
More like 20 minutes. Can't wait until someone makes a Hunyuan Turbo XL of this garbage.
>>
File: HunyuanVideo_00371.mp4 (85 KB, 640x480)
85 KB
85 KB MP4
>>
>>103524788
>Turbo
why would you want to further lobotomize something that's already very SOTA and WIP?
>>
>>103521250
Thank you king!
>>
File: HunyuanVideo_00372.mp4 (343 KB, 640x480)
343 KB
343 KB MP4
>>
>>103524865
So bootiful :''(
>>
File: HunyuanVideo_00373.mp4 (373 KB, 640x480)
373 KB
373 KB MP4
>>
>>103524506
>>103524746
Does anyone by any chance still have the Lora for the Bogs?
These are so bloody good, can't stop fucking laughing.
>>
>>103524928
https://civitai.com/models/1035770?modelVersionId=1166218

There you go
>>
File: HunyuanVideo_00374.mp4 (296 KB, 640x480)
296 KB
296 KB MP4
>>
>>103524972
What a fucking legend, thanks king.
Something I'll look forward to when I get hang of comfyui more.
>>
File: HunyuanVideo_00375.mp4 (226 KB, 640x480)
226 KB
226 KB MP4
>>
File: HunyuanVideo_00376.mp4 (235 KB, 640x480)
235 KB
235 KB MP4
>>
File: 1704583506586124.png (1.3 MB, 1024x1024)
1.3 MB
1.3 MB PNG
question, noobAI vs pony, nai seems to work well with just booru tags, but is it better at this point?

seems to do backgrounds nice, this is without a rei lora for example.
>>
>>103525154
Yes.
>>
>>103525161
whats the diff between vpred and eps models? all i've read so far is eps is better for loras, apparently
>>
>>103525168
vpred supposedly has better contrast
>>
File: 1714232009806128.png (1.11 MB, 1024x1024)
1.11 MB
1.11 MB PNG
>>103525176
this is the latest vpred one (still learning settings/etc). so far im impressed, this is just a generic 1girl, frieren tag, classroom prompt. it's nice that there are good results even before character/style loras.
>>
File: 1713115188145209.png (1.11 MB, 1024x1024)
1.11 MB
1.11 MB PNG
>>103525186
same prompt except suzumiya haruhi:
>>
File: 1707784988725171.png (1.2 MB, 1024x1024)
1.2 MB
1.2 MB PNG
>>103525192
one more, just with mari booru tag

less reliance on loras is nice BUT you still have the option to use character loras, or style loras. but artist styles work too, its a neat model.
>>
File: 1713075154701984.png (1.11 MB, 1024x1024)
1.11 MB
1.11 MB PNG
bulma waving hello, no lora

shirt is easily fixed with inpainting with flux fill or PS, still neat
>>
>>103523727
how much time to generate that?
why is you flow so high?
>>
>>103523978
it never went away, every few days some journalist finds a new angle to to fuel the panic
>>
File: 1724691137364601.png (1.09 MB, 1024x1024)
1.09 MB
1.09 MB PNG
I like the aesthetics but more importantly the model can do good backgrounds, that's my main gripe with pony based checkpoints. even without loras I like NAI so far.
>>
How do you get less realistic more "perfect" skin? It's giving me blemishes instead of smoothed photoshoped perfection in higher steps in the videos I'm genning.
>>
>>103525258
your shit must be cursed because basically every video shown in this thread and on that porn general have girls, even the guys, with perfect skin
you should show the video and your settings so people can help
>>
File: 1734111074417097.png (3.43 MB, 2048x1124)
3.43 MB
3.43 MB PNG
https://www.reddit.com/r/StableDiffusion/comments/1hen24r/comfyui_fluxmod_run_flux_with_88b_parameters/
https://github.com/lodestone-rock/ComfyUI_FluxMod
>A modulation layer addon for Flux that reduces model size to 8.8B parameters without significant quality loss.
Interesting technique, if this can be used on Hunyuan we could be eating really good
>>
File: 1714264843148137.png (72 KB, 1331x481)
72 KB
72 KB PNG
>>103522557
>and noone is talking about it.
becaus they only released the bad version lol
>>
File: HunyuanVideo_00377.mp4 (292 KB, 640x480)
292 KB
292 KB MP4
>>
>>103525121
>>103525479
Would you share the settings you used to train? Specially the amount of pics.
>>
>>103525479
>You're a wizard, Igor
>>
>>103525494
28 images of Igor and the other one. They were basically together in all the images.
Tagged with joycaption but I don't know how necessary the tags even were.
1100 steps for 40 epochs.
Rank 32, but I think it would work just as well at 16tbh
My previous one was at 600 steps and it also functioned fairly well but tended to produce partially bogged subjects than fully bogged ones.
>>
>>103525520
>They were basically together in all the images.

damn maybe that's why the lora is so fucked, might just try to crop igor out of every image and train on the crops. Could even use SDXL to touch it up slightly to a proper resolution even.
>>
File: 1703539458365066.jpg (1.69 MB, 1248x1824)
1.69 MB
1.69 MB JPG
>>103525176
Vpred's contrast is too high, and it generally seems to overcook the image to shit and back. This is epsilon11
>>
>>103525526
Fucked? I think it does what it was supposed to. Bog people.
>>
File: 1705393089939422.jpg (1.49 MB, 1248x1824)
1.49 MB
1.49 MB JPG
>>103525536
...and this is vpred with the same prompt and seed.
I dunno, maybe it needs a different config or something
>>
File: 1734199045350446.mp4 (440 KB, 640x400)
440 KB
440 KB MP4
>>103525537
>I think it does what it was supposed to. Bog people.
well you certainly got me there
just figured it was still a WIP because you never re-recreated that one dracula gen since the initial gen was a failure with bogman looking like a still cutout
those were hilarious by the way, the initial lora attempt just having them stand around like cardboard cutouts really awkward and uncanny.
>>
>>103525571
The first one was seriously overtrained on a dataset that was basically 4 4x4 grids of bog faces. It was never gonna work. Maybe I can get a Dracula out of this one, let me try.
>>
File: 1703731393285278.png (938 KB, 1024x1024)
938 KB
938 KB PNG
yeah, i'm figuring out stuff like cfg and using the advised positive/negative prompts, but I can see the strengths of noobAI and illustrious right now.

latest vpred (0.9) model:
>>
File: HunyuanVideo_00378.mp4 (396 KB, 640x480)
396 KB
396 KB MP4
>>
File: 1721774987986701.png (763 KB, 1024x1024)
763 KB
763 KB PNG
>>103525608
compared to pony, it seems to have better colors/shading, and NAI can do backgrounds (ponys main flaw, even if you can edit it later with a white background)
>>
https://github.com/kijai/ComfyUI-HunyuanVideoWrapper/pull/149

Thoughts?
>>
>pony
Any "insider" info about the progress? Did he decide to drop the idea? It's 2025 in two weeks, v7 or v6.9 whatever should have been released in Summer to remain on top of the game.
>>
>>103525520
how long did that take
I haven't really fucked with wsl settings at all, but I was getting 125 seconds per iteration and cancelled it after the first epoch
>>
File: 1714389001454698.png (1.01 MB, 1024x1024)
1.01 MB
1.01 MB PNG
>>103525616
but it's amazing how well it works with generic booru tags. use this extension with it:

https://github.com/DominikDoom/a1111-sd-webui-tagcomplete

then get style loras or characters if you really need an obscure one. still, very impressed just with very little use of it. Using the default positive/negative prompts (like score_9 but for illustrious)

Prompt Prefix:

masterpiece, best quality, newest, absurdres, highres,

Negative Prompt:

worst quality, old, early, low quality, lowres, signature, username, logo, bad hands, mutated hands, mammal, anthro, furry, ambiguous form, feral, semi-anthro

just removed the nsfw negative prompt cause thats for me to decide. I love the colors, though.
>>
>>103525645
it's a meme, using a visual model to get the prompt from an image won't get you something even close to what you're trying to achieve with i2v
>>
File: 1708607688802352.jpg (1.36 MB, 2016x1152)
1.36 MB
1.36 MB JPG
>>103525654
>fantastic model
>doesn't recognize unicorn overlord characters
So close yet so far
>>
File: HunyuanVideo_00379.mp4 (318 KB, 640x480)
318 KB
318 KB MP4
>>103525653
It should be like 6s/it on a 3090. Something is wrong.
This was like 2 hours of training.

>>103525669
I figured at much, any experiments I've done using clip embeddings as a prompt in other projects never work.
>>
File: 00009-224305787.png (2.24 MB, 1432x1432)
2.24 MB
2.24 MB PNG
>>103525645
potential bigtime happening? I hope you hunnyan boys are running over 20gb of vram kek

>>103525648
eternally btfo'd by illustrious, then its grave pissed on by noobai. It's over for ponyfag.
>someone please for the love of god make a ponyrealism equivalent for noob im dying here the chink bugs are the only ones doing it and they suck
>picrel genned in ponyrealism
>>
File: 1717231287891163.png (1.14 MB, 1024x1024)
1.14 MB
1.14 MB PNG
>>103525654
*also, adetailer and controlnets work fine with this too, all the usual extensions work fine. civitai helper too for your loras/styles/etc.
>>103525684
thats where the loras come in. still, amazing model: it even knew misaki from NHK from the booru prompt.

https://github.com/DominikDoom/a1111-sd-webui-tagcomplete

I didnt make this (really) but it's so good for adding prompts with the exact tags (what the dataset was trained with).
>>
>>103525701
only thing I dont know yet is whether cfg 4 or 5 is ideal, it just recommends a range. I am a noobAI noob, ironically
>>
>>103525689
I know that it got btfo'd, switched myself since 0.1, but I wonder if there was any official word from the ponyfag. Or any observable butthurt, anything.
>SDXL -> full anime/cartoon tune -> back to full realism tune pipeline
Unironic mental illness.
>>
File: 1721951481388691.jpg (243 KB, 763x601)
243 KB
243 KB JPG
>>103525701
Yeah i've been using tag autocomplete since 1.5, it's great.
Noob is also a great porn model. Very knowledgeable about interactions between two humans and with fantastic creativity.
>>
>>103525686
kek why does he look so confused, baffled, even terrified of the wine glass?
its like he was having an existential crisis and thinking to himself "..what IS a man?!"
>>103525722
>Unironic mental illness.
well how else do you expect to get the creativity and robustness of a 2d model in the form of 3D/realism? a realism lora? LMAO
>>
>>103525727
He's supposed to be "sneering" but I guess he was too bogged.
>>
File: 1714058947657847.png (676 KB, 1024x1024)
676 KB
676 KB PNG
>>103525725
I did a nude prompt (not of rei) to test. it's very good, surprisingly good nips. What I like most is how many characters work for prompts without loras, now I can use loras for styles mainly or obscure characters.

Is training illustrious based loras simple? it's SDXL based so does kohya work?
>>
>>103525742
>not of rei
That's good because if you did I'd have to report you to the FBI.
>>
>>103525727
I expect them to make an SDXL finetune in the same way pony/illustrious were made. By grabbing a ton of varied pics and following the recipe.
Yeah maybe you won't get something extremely exotic but no way the general quality won't be higher. Though I wonder what kind of concept you can't find in 3d, that you still could portray in 3d by genning.
>>
>>103525751
nah, I test that stuff with something appropriate: igawa asagi.

...which the model knows without a lora, it basically knows everyone. Although I couldn't get noko shikanoko to work but deer girl anime is new, can use a lora for that.
>>
What's with the "very awa" tag i see in many civitai noobai prompts? Some sort of quality tag?
>>
File: 1711055919094938.png (949 KB, 1024x1024)
949 KB
949 KB PNG
it even knows umu with no lora. progress!
>>103525775
I think it's like pony's score_9, it picks the very top results or the best results, it's like a masterpiece prompt. the civitai page for the model has a list of recommended prompts but oddly the awa one isnt there.
>>
>>103525787
>isnt there
It's on the HF page.
>>
>>103525753
this shit is nuts man, i do think you gotta be on the spectrum in some major form to be able to pull off any of this
which is why i hope next year i can jump on the train and do some of the work myself with a new system. I feel like we still haven't fully seen the potential of noob yet.
>>
File: 1723155345652501.png (1.09 MB, 1024x1024)
1.09 MB
1.09 MB PNG
no lora, yep noobAI/illustrious is the way. oddly enough the model wasnt working in my other webui (for pony/sdxl) but everything is fine on a fresh reforge install/git clone.

jack the ripper \(fate/apocrypha\), fate \(series\),masterpiece, ass, best quality, newest, absurdres, highres, very awa
>>
>>103525787
My nigga it knows even Fiorayne from Monster Hunter. Of course it knows mainstream characters.
>>
>>103525820
*im not sure if the awa prompt is necessary but it seems to work, so it may be a score_9 type of thing. it works, so ill just leave the basic prompts as default in ui-config.json.
>>
File: 1713356467375555.png (1.2 MB, 1024x1024)
1.2 MB
1.2 MB PNG
aesthetics aside, the main benefit is that it seems more like anime and less "fake", the colors are nicer, the lineart seems better, and most importantly it can do backgrounds. Pony was SHIT at backgrounds. Good for nsfw/poses, bad at backgrounds.
>>
Pony guy still going over licenses with his expensive lawyer trying to find the cheapest model to train on so he can scam for more while the rest of the world has moved on.
>>
I thought bfl had changed the flux license terms months ago
>>
>>103525846
Experiment with different artists too. They change the output tremendously.
>>
File: 1728194551088777.png (1.53 MB, 1024x1024)
1.53 MB
1.53 MB PNG
>>103525846
also, controlnets seem better: here is a canny with saber as the prompt.

very good model, it's a definite step forward.
>>
>>103525884
*with pony I always had to change it to "prompt more important" to get it to work. this works fine even with the default balanced.
>>
File: HunyuanVideo_00382.mp4 (139 KB, 640x480)
139 KB
139 KB MP4
>>
File: 1729975746079544.png (1.48 MB, 1024x1024)
1.48 MB
1.48 MB PNG
>>103525884
same image but with camilla \(fire emblem\) instead, works well: and again, no lora was used!
>>
That's only one guy talking to himself, he always ends his sentence with a period "."
>>
>>103525900
what a time to be alive where we can generate bogs in any scenario.
>>
File: HunyuanVideo_00383.mp4 (175 KB, 640x480)
175 KB
175 KB MP4
>>103525915
Sometimes the bogs turn Chinese and I don't know why.
>>
>>103525952
with this and ai audio the possibilities are endless.

speaking of which, I can get perfect Trump audio with e2-f5-tts, it's amazing for natural sounding speech with a small sample.
>>
>>103525960
I think there are packages out there that let you puppet and reanimate faces. I forget the name. But you could easily have the moving clips speak in any way you want them to and it looks fairly seamless.
>>
>>103525915
>>103525960
>>103525970
>>103525952
see? he always ends with periods >>103525914
now I'm starting to think if this is a bot instead
>>
>>103525952
>Chinese
looks like Taiwanese.. taipei 101 in the background. God damn I miss those chicks.
>>
File: Untitled.png (21 KB, 522x201)
21 KB
21 KB PNG
>>103525973
You're a fucking schizo. You're supposed to end a sentence with a period.
Generating bogs and posting them while I do work stuff doesn't make me a bot, retard.
>>
File: HunyuanVideo_00384.mp4 (115 KB, 640x480)
115 KB
115 KB MP4
>>
>>103525983
>writing well on 4chan
lol? lmao even?
>>
>>103525983
kek dont respond to 'em boganon, it's probably the same fed fag that posts the pedo shit and tries to start the infighting.
i wonder what igor in tiananmen square would look like.. i wonder if it'd make him even more chinese than this >>103525952
>>
>lmao xd u use capitalization and full stops? liek ru a fggit or wat?
>>
>>103525997
>capitalization and full stops
yanked brit or a lime'd yank?
>>
File: HunyuanVideo_00385.mp4 (116 KB, 640x480)
116 KB
116 KB MP4
Perhaps xi was already kind of bogged?
>>
>>103525988
It's automatic. Imagine writing 10–20 emails a day that must be grammatically perfect. You get used to it.
>>
File: HunyuanVideo_00095.webm (1.26 MB, 1280x720)
1.26 MB
1.26 MB WEBM
>accusing a random person of starting infighting for no reason
a little ironic dont you think
>>
>>103526049
>Imagine writing 10–20 emails a day that must be grammatically perfect.
I work as an engineer and I let chatgpt do the faggot e-mail writing for me kek
>>
File: HunyuanVideo_00386.mp4 (293 KB, 640x480)
293 KB
293 KB MP4
>A man dances at Tiananmen square in front a tank.
>>
>>103526098
>those chinks haven't even censured tianamen square
how based are they seriously?
>>
>>103526110
It's a psyop to get our vram occupied with their video model to stunt any research in other areas.
>>
>>103526110
tiananmen square is a real place that has tank parades all the time anon, just because they ran over some protestors skulls with tanks one time 30 years ago doesn't mean it's illegal to suddenly generate a tank in tianmen square
>>
>>103526127
they could've trained the model to shit itself when you associate the tank with tianmen square
>>
>>103526135
>trained the model to shit itself when you associate the tank with tianmen square

That sounds like a very difficult thing to do.
>>
>>103526050
how do you get this shiny look
oily?
>>
File: HunyuanVideo_00387.mp4 (423 KB, 640x480)
423 KB
423 KB MP4
lol
>>
hey asshole >>103521168, that's my webbum
show yourself coward
>>
>>103526160
wow file a dmca
>>
>>103526146
something about "oiled glistening skin" will get you there
for that one it was "her shiny skin is oiled and glistening"
>>
>>103526160
you can't act like that, to make an AI model you must train it with millions of images/videos of people and not a single time you asked for your permission, that's hypocritical
>>
>>103526061
I hope you're not sending those emails to other engineers because it's dead obvious and everyone will hate you for wasting their time.
Just send the prompt as the email. The only thing chatgpt does is inflate the word count.
>>
>>103526186
*for their permission
>>
File: HunyuanVideo_00388.mp4 (321 KB, 640x480)
321 KB
321 KB MP4
>>
>>103526186
the model is under a permissive license, my image isn't
see you in court
>>
>>103526209
Anon is so fucked. He's really kicked the hornet's nest.
>>
File: 1728990837892550.png (600 KB, 1024x1024)
600 KB
600 KB PNG
paper mario noobAI lora, neat how it works with various prompts:
>>
File: 1706572513797515.png (580 KB, 1024x1024)
580 KB
580 KB PNG
>>103526225
>>
File: 1716158680598045.png (687 KB, 1024x1024)
687 KB
687 KB PNG
>>103526236
>>
File: file.png (117 KB, 1011x450)
117 KB
117 KB PNG
>>103526140
I'm sure it's possible.
>>
File: HunyuanVideo_00389.mp4 (435 KB, 640x480)
435 KB
435 KB MP4
>>
File: 1709970961156019.png (308 KB, 640x660)
308 KB
308 KB PNG
>>103526209
>the model is under a permissive license, my image isn't
and the images used during the training, were they under a permissive license?
>>
>>103526270
objection, irrelevance
If tencent stole media without informing their customers then that would make their customers victims of fraud
>>
>>103526295
>that would make their customers victims of fraud
that mean we could also sue tencent? kek, gimme that moneyyy
>>
>>103526295
>without informing their customers
*laughs in eula*
>>
I noticed that the lower the resolution we use on hunyuan, the more zoomed in the outputs gets, as if when they trained the model with lower resolution, they simply cropped HD videos to fit into smaller resolutions, I hope it's not the case that would be retarded
>>
>>103526326
from what i know about ML there is a moderate to high chance this is the case
>>
File: HunyuanVideo_00390.mp4 (339 KB, 640x480)
339 KB
339 KB MP4
>>
>>103526171
thanks anon
>>
>>103526347
arachnophobia jump scare warning
>>
>>103526347
>dark souls 1.mp4
>>
flipping through the license and apparently you're not allowed to use it if you're from the UK, EU or south korea
also this travesty of English
>You will defend, indemnify and hold harmless Us from and against any claim by any Third Party arising out of or related to Your or the Third Party’s use or distribution of the Tencent Hunyuan Works.
which I think means you'll keep your mouth shut about copyright
>>
>>103526401
>you're not allowed to use it if you're from the UK, EU or south korea
kek
>>
>>103526401
>>
File: 1720942054247323.png (742 KB, 800x450)
742 KB
742 KB PNG
>>103526401
>flipping through the license
>>
>>103526401
>you're not allowed to use it if you're from the UK
OI! YOU GOT A LOICENCE FOR THAT M8?
>>
>>103526401
why do you subject yourself to this meaningless unenforceable stuff
>>
File: 1704631646061058.png (837 KB, 1024x1024)
837 KB
837 KB PNG
nice view!

but seriously, this model does colors and lineart a lot better than pony imo, and backgrounds. better prompt understanding and a huge character set based on booru tags, so not as much reliance on loras. but you can still use them for styles and characters.
>>
For hunyuang lora trainers are you guys captioning your datasets at all? I'm gonna train a lora tonight and have a quite large image set that's already captioned for an anime model (illustrious). Should I run them all through a captioner to convert them to natural language? Should I just leave them uncaptioned?

If I get no response I guess I'll start with converting to natural language.
>>
>>103523727
Interesting, I would like to see an example that doesn't feel like slow motion, I wonder if that's how it worked, a video that could fit in 5 seconds extended with slower pace.
>>
hunyuan I sometime get slow motion or super fast and I still don't get why
>>
File: 1706498521438228.png (1.6 MB, 1024x1024)
1.6 MB
1.6 MB PNG
>>103526505
it's just so vibrant compared to pony stuff (in general), I thought it was a vae issue but nope.
>>
>>103523820
This image was weird, the thumbnail made me think it was Taylor Swift, but then I clicked on it and it was someone else.
>>
>>103526525
I think it depends on the numbers of frames you put
>>
>>103526513
I can't definitely say whether or not captions hurt of help I've tried with and without and got more or less what I wanted. Just like flux.
My anime experiments have no turned out well though and I don't know if that's just bad data or the model just doesn't play well with anime.
>>
>>103526535
I always use 97
>>
File: 1726934953872514.png (1.13 MB, 1024x1024)
1.13 MB
1.13 MB PNG
>>103526526
but the best part, it can do actual backgrounds, which pony can't really do.
>>
>>103526547
the number of frames recommanded by tencent is 129, that's the number the model is the most used to
>>
>>103526533
you just got A.I psyopped
>>
>>103526543
Ok, thanks! I'll try converting it to natural language, one issue is there's several characters in my dataset so I assume I need to give it something to identify them with. Hope it's not a disaster.
>>
File: 1718123313978728.png (1.41 MB, 1024x1024)
1.41 MB
1.41 MB PNG
2b \(nier:automata\) with "casual clothes" in the forest, it even got the mole right, which pony loras would fuck up (and require inpainting).

very good model.
>>
>>103526587
>there's several characters in my dataset
That always turned into a problem for flux, prays it works out for you but I wouldn't hold my breath.
>>
>>103526599
We all know.
>>
File: 1721588614081823.png (1.63 MB, 1024x1024)
1.63 MB
1.63 MB PNG
lmao

2b \(nier:automata\), mexican sombrero, casual clothes, masterpiece, best quality, newest, absurdres, highres, very awa, smile, mexico border

just wanted to test an out there prompt. SOUL
>>
File: 1707349776779665.png (1.16 MB, 1024x1024)
1.16 MB
1.16 MB PNG
wow, I never got an alice this good even with a pony nikke lora. genuinely impressed at the base model.
>>
>1girls posted once more on /ldg/
are we finally back?
>>
>1girls spam are back
It's over...
>>
So either way we're at slops-per-second.
Sovl when?
>>
>>103526734
there is video too which is good, im just testing noobAI. it's legit a step above the other stuff and a new model was released recently. (vpred 09)

we're getting to new levels both in static images and video generation.
>>
>>103526756
>it's legit a step above the other stuff
true, I've tested pony and noob finetunes back and forth, and illustrious/noob really has more going for it
>>
File: 1715029088200834.png (234 KB, 1141x1608)
234 KB
234 KB PNG
>>103526756
>it's legit a step above the other stuff and a new model was released recently. (vpred 09)
some people feels it's a downgrade
>>
File: 1716607303928016.png (1.17 MB, 1024x1024)
1.17 MB
1.17 MB PNG
>>103526766
even if you ignore everything else, the fact that it can do backgrounds is a step above what pony finetunes could do. and I like the autismmix model, but the main gripe is you make nice model, meh background/scenery. so you'd have to chop the model out and gen a good background with something else.

this for example is just a simple prompt with beach and jeanne d'arc alter \(fate\).
>>
>>103526782
some say the epsilon model is more lora friendly but ive had no issues, just have 2 models or whatever you prefer, ive had some good gens with the latest one and ill prob try others.
>>
>>103526782
People like this are just retarded consumers. Ignore and keep building.
>>
>>103526782
My guess would be it's even more annoying to tardwrangle in it's vanilla form. Just needs some finetune love I bet.
>>
>Windows: https://rentry.org/crhcqq54
shit guide doesn't work with python 3.12
>>
>>103526886
what error you got?
>>
File: HunyuanVideo-_00042.webm (857 KB, 960x544)
857 KB
857 KB WEBM
>>
>>103526886
Yes it does. Where are you tripping up?
>>
>>103526917
Do this, but pan it up to reveal a bog face. Or better yet furk.
>>
>>103526928
>Do this, but pan it up to reveal a bog face. Or better yet furk.
you want that reaction kek
https://www.youtube.com/watch?v=34gmbdmn3Gc
>>
>>103526899
AttributeError: module 'pkgutil' has no attribute 'ImpImporter'. Did you mean: 'zipimporter'?

my machines dependencies could just be irreversibly fucked, i'll restart everything from an anaconda environment
>>
>>103526959
I'm so glad I venved the whole thing
>>
File: 1731179055135328.png (259 KB, 1576x264)
259 KB
259 KB PNG
>>103526959
oh damn, this is what Claude told me
>>
File: 1704824788242815.png (187 KB, 1690x1627)
187 KB
187 KB PNG
>>103526959
>>103526993
you get more infos here
https://stackoverflow.com/questions/77364550/attributeerror-module-pkgutil-has-no-attribute-impimporter-did-you-mean
>>
Any word from furk on a LoRA training post?
>>
>>103526993
>>103527001
I think the best solution is to just create an environment with python 3.11 and roll from there
>>
>>103526993
I had zero problem installing sageattention and I'm using python 3.12.3
>>
>>103527009
What I want to know is what the fuck does he mean when he constantly spams "fully Fine Tune / DreamBooth" is it one or the other?
>>
>>103527063
>3.12.3
dunno what version ComfyUi has, I'm still on 3.11.9, I thought the 3.12 version of ComfyUi was 3.12.7
>>
bakerman
>>
>>103527064
I doubt he even knows.
>>
>>103527134
top kek
>>
>>103526928
This is why I always need to see the face, like if you give us a feet video I need to see who it belongs to!
>>
>>103527064
Schrodinger's LORA
>>
someone post a list of all dependencies and version numbers for a working hunyan please
>>
>>103527423
>>103527423
>>103527423
>>
>>103526600
Hmm I had mild success with it on flux. With SDXL it was horrible, basically found it impossible to do multi character loras in SDXL, had to rely on finetunes instead.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.