[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: the tallest dick general.mp4 (3.36 MB, 948x1688)
3.36 MB
3.36 MB MP4
Discussion of Free and Open-Source Diffusion models.

Videogen is a Meme Edition

Last bread : >>103463355

>Local (Hunyuan) Video
Windows: https://rentry.org/crhcqq54

>UI
Metastable: https://metastable.studio
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
reForge: https://github.com/Panchovix/stable-diffusion-webui-reForge
ComfyUI: https://github.com/comfyanonymous/ComfyUI
InvokeAI: https://github.com/invoke-ai/InvokeAI

>Models, LoRAs, & Upscalers
https://civitai.com
https://tensor.art/
https://openmodeldb.info

>Cooking
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts

>Flux
Forge Guide: https://github.com/lllyasviel/stable-diffusion-webui-forge/discussions/1050
ComfyUI Guide: https://comfyanonymous.github.io/ComfyUI_examples/flux
DeDistilled Quants: https://huggingface.co/TheYuriLover/flux-dev-de-distill-GGUF/tree/main

>Guides & Tools
Share the Sauce: https://catbox.moe
Perishable Sauce: https://litterbox.catbox.moe/
Generate Prompt from Image: https://huggingface.co/spaces/fancyfeast/joy-caption-alpha-two
Artifact resources: https://rentry.org/sdg-link
Samplers: https://stable-diffusion-art.com/samplers/
Open-Source Digital Art Software: https://krita.org/en/
Txt2Img Plugin: https://kritaaidiffusion.com/
Collagebaker: https://www.befunky.com/create/collage/
Video Collagebaker: https://kdenlive.org/en/

>Neighbo(u)rs
>>>/aco/sdg
>>>/aco/aivg
>>>/b/degen
>>>/c/kdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/tg/slop
>>>/trash/sdg
>>>/u/udg
>>>/vt/vtai

>Texting Neighbo(u)r
>>>/g/lmg
>>
File: deBO_00029_.png (1.23 MB, 1728x1344)
1.23 MB
1.23 MB PNG
>mfw
>>
Is it just me or is Hunvid getting worse with each pull of the repo?
>>
>>103468057
I seem to be able to cram more frames into my vram after every pull. It must be coming at the cost of something
>>
>he pulls
>>
File: bogsylvania.mp4 (339 KB, 720x480)
339 KB
339 KB MP4
>>103468057
>>103468063
>he pulled
>>
>>103468079
the fuck? I swear I genned something like this months ago but it didn't move.
>>
Death to videofags
>>
assuming you're making weeaboo slop and aren't a vramlet, when do you use noobai over flux? is noobai just for へんたい?
>>
when did it start decoding rows
>>
File: ComfyUI_03131_.png (1.17 MB, 1024x1024)
1.17 MB
1.17 MB PNG
>>103468101
Here, I found it lol. I knew I saw it somewhere before.
>>
>>103468101
are you the guy that genned vampire bog? please catbox your original gen man i love it so much >>103468113

i just took the pic and threw it into cogvideox, prompted lightning in the background and thats it.
>>
>>103468117
Here you are.
https://files.catbox.moe/f0z3t9.png

But you won't get too far without the LoRA I'll link that below.
https://gofile.io/d/qLKkfw
>>
>>103468142
thanks King
>>
File: out.webm (2 KB, 256x256)
2 KB
2 KB WEBM
>he pulled
>>
Should I pull? Yes or no
>>
>>103468158
np, if I recall correctly, the trigger world was Igor Bogdanoff, but I don't think it changed the outputs too much.
>>
>>103468176
Works fine for me but I can't guarantee it will for you. I guess if you're not a king of vramlets (24) it might be an issue.
>>
t2v is kill, black videos
what setting did the new nodes add that broke it
>>
did the Hunyuan devs announce any ETA on the image to video weights?

And btw, it would be nice if they could release a new pure T2I model as well. I like how even some stills from hunvid doesn't look like aislop. I would easily replace Flux with that
>>
apparently this furshit video is hunyuan
>>103468122
>the catbox wolf girl not pooh
>>
https://github.com/kijai/ComfyUI-HunyuanVideoWrapper/pull/72

Take a look at the hopium in this PR.

>As easy to train as Flux
>Generalizes from images
>Inference code works out of the box

This is almost as good as img2vid.
>>
>linking lmg reposts of vids from ldg
>>
>>103468193
Oh, I'm not using ChangVid
>>
>>103468234
@ me next time with correction, you spineless faggotron
>>
Anyone else think sora is kind of embarrassing? Like all that hype and it's just "okay". And they're charging hundreds a month for access to it. Imagine being scammed like that
>>
>>103468287
is there any use for it outside novelty?
if you take footage and put it in a commercial, is that within the TOS?
>>
>>103468108
Yeah
>>
>>103468108
Noobai for whatever weird niche porn you're into, flux for useable images that don't require close up inspection.
>>
>>103468287
The hype would have been justified if they didn't fucking wait 10 months since the announcement to release it allowing everyone to catch up
>>
>>103468385
Imagine hiding generating human beings behind a paywall. It's a funny joke.
>>
>>103468390
I remember them disallowing generating humans in the early days of dalle2 for "safety" reasons. All you could make with it was boring paintings, landscapes and animals.

Maybe in their minds less people generating humans allow them to have more control over the "safety" of the outputs and is also an excuse they can use to jew people out into paying the pro subscription.
>>
>>103468230
>>103468230
>>103468230

Is this fake news or not? If not this is pretty interesting.
>>
>>
Bets on first anon vid lora? I'm thinking some streamer whore
>>
>>103468487
Where is the training code? If some literal who can figure it out it can't be that different from the training code we have for flux
>>
>>103468487
I am not even a weeb, but I hope someone does a good finetune to fix anime motion, as the base model only outputs garbage in that style.
But I am not sure a Lora alone can fix that. Maybe that would require a full finetune
>>
>>103468487
Mayli
>>
>>103468560
LoRA can only generalized from images put into it. Idk what actual training on motion would involve but I imagine it would be really really gpu intensive.
>>
>>
File: out.webm (151 KB, 480x640)
151 KB
151 KB WEBM
>>
File: out.webm (157 KB, 480x640)
157 KB
157 KB WEBM
>>
Kijai's ego will not allow LoRA support to be merged into the main branch.
>>
GGUF ME UP ALREADY I WANT QUANTS!
>>
>>103468768
meh. There is a reason we have 5 different usable UIs that function within 99% of each other. Hopefully he won't be salty about it.... well at least not salty as others have been.
>>
>>103468784
He seems incredulous that it actually works.
>>
>>103468786
this shit is magic. I get minor dopamine hits when I resolve pip dependency issues.
>>
File: out.webm (107 KB, 480x640)
107 KB
107 KB WEBM
>>
File: HunyuanVideo_00050.mp4 (1.7 MB, 544x960)
1.7 MB
1.7 MB MP4
>>
>>103468004
>>
comfy killer
>>
File: 162a.jpg (90 KB, 912x1200)
90 KB
90 KB JPG
>>
>>103469183
catbox?
>>
>>103469208
just control_net for pose.
hair is inpainted
>>
File: Untitled.png (27 KB, 1084x336)
27 KB
27 KB PNG
Happening alert, Kijai admits LoRA weights are applied to model in current PR. Pending further testing.
LoRA support and training code soon?
>>
>>103469272
I don't wanna go down another lora rabbit hole, I hope my specs aren't good enough so I can just let this one slide
>>
>>103468745
>>103468798
clown girl bros.. we are so back..
>>
File: Untitled.png (21 KB, 1084x336)
21 KB
21 KB PNG
>>103469302
Sorry anon, it looks reasonable.
>>
>>103469362
If you're going to disable font antialiasing at least pick a font that looks good aliased.
>>
>>103469399
I keep it off to annoy people at this point.
>>
File: HunyuanVideo_00243.mp4 (598 KB, 960x544)
598 KB
598 KB MP4
>>
File: grift.plus.jpg (300 KB, 1729x964)
300 KB
300 KB JPG
Oh nice they trained on public domain only, they must care about open source and freed-
>>
>>103469437
The fuck is this?
>>
>>103469448
The latest grift
>>
anon testing sana: are you using the multiling or the base one in that topless gen?
>>
>>103469437
So they are not releasing weights?
>>
>>103469469
Training a saas-only model on public domain is worse than copyright infringement desu
>>
>>103469502
It's like guaranteeing your model will be shit and you're charging for it.
>>
>>103469437
is this image aggregator or what
>>
File: HyunVideo_00032.png (1.62 MB, 1120x1440)
1.62 MB
1.62 MB PNG
I gotta give this model credit, even flux has no idea what hair curlers and cigarettes are
now I have my trailer trash waifu
>>
>>103469655
Why do static Hyun images look so waxy and untextured? I've seen plenty of videos that look good, and individual images from those videos that look good, but single outputs always look like garbage.
>>
File: 1733772978202988.mp4 (351 KB, 960x544)
351 KB
351 KB MP4
>>103469686
It's the encoder.
>>
>>103469530
Artists won't like it any better, they don't really care if the author of the training material has been dead for 75+ years, they just don't want AI at all
>>103469618
Model trained exclusively on public domain artwork that they're going to charge $10/month for
>>
File: 1720807640035511.mp4 (627 KB, 1024x512)
627 KB
627 KB MP4
>trellis needs 16 gb vram
Oh for fuck's sake
So close, yet so far away
>>
>>103469718
what good is it without rigging
>>
>>103468004
You're telling me bottom left is generated with ai?
My friend wants to know how would one go about doing that
>>
>>103469723
Rigging isn't 1/10th of the issues with the models this puts out. 3D modeling is multitudes more complex than 2D art. Not to say it's harder for a human to do, it's fairly straight forward, but the level of understanding and reasoning that go into making a competent 3D model go beyond what any current AI model can reasonably do. I guess they can do rocks or inanimate objects well.
>>
>>103469723
arent there auto riggers?
>>
>>103469739
NTA but most 3D software has auto rigging.
>>
>>103469465
Anon with the booba gen here. I'm using this one:
https://huggingface.co/Efficient-Large-Model/Sana_1600M_1024px/tree/main

I don't know what the multiling you're referring to is, so I'm probably not using it
>>
>>103469734
that was a lot of words to say that trellis makes bad models
>>
>>103469741
0 experience with rigging. Can't you just slap those 3d models in there and use that?
>>
>>103469743
it's in the collection, basically a 1.1 version that supports chinese and emojis, but apparently it's just better
>>
>>103469768
You can rig anything with vertices. But you need very specific topology for it to deform in a way that doesn't look awful.
>>
>>103469769
Ahh many thanks, sleepy and about to go to bed but I'll give it a whirl tomorrow
>>
File: HunyuanVideo_00245.mp4 (450 KB, 960x544)
450 KB
450 KB MP4
>>
>>103469272
>4gb more VRAM
that's a lot... you already don't have much vram room to spare to make the model run a decent resolutions
>>
>>103469686
>Why do static Hyun images look so waxy and untextured?
yeah I noticed that, on the paper they said that they trained on images first then went onto videos, my guess is that they used AI slop images, but because there's little AI slop videos on the internet they had no choice but to go for real videos, thank god it happened that way
>>
>>103468230
to be fair I don't see why loras training won't work well on Hunyuan, it's a good base model and yeah it's distilled so what? Flux is also distilled and the loras work flawlessly with it too
>>
I'mma get a 5090 and put all you putz's to shame
>>
How do I fix the t2v black screen after updating?
>>
>>103469994
I... don't remember...
I had it too but it fixed itself, sorry
check your model loader wasn't changed to fp16
>>
File: HunyuanVideo_00246.mp4 (365 KB, 960x544)
365 KB
365 KB MP4
>>103469978
4gb then goes back down to normal. Likely a memory management oversite.
>>103469985
There is insane amounts of slop in the dataset, but it's interesting to see how it makes the slop move.
>>103469987
flux falls apart with some LoRAs
>>
>>103469997
Oh wise man, how do we train models with 0 sloppa input
>>
File: 1704838752175738.png (109 KB, 224x225)
109 KB
109 KB PNG
>>103470002
you take a look at your tens of millions of input and you remove the AI slop one by one, sounds easy enough
>>
>>103470004
aren't there supposed to be tools purpose made for detecting AI slop? can't they filter it out
>>
>>103470004
I mean just hire pajeets to manually go through images and discard the slop, nothing beats a human eye
>>
>>103470007
there are tools, and I don't know if they're efficient enough though, it'll never be better than a human doing it though, same thing for captioning
>>
File: HunyuanVideo_00118.webm (466 KB, 544x960)
466 KB
466 KB WEBM
you can't defeat AI sloppa, especially once it becomes flux r/roastme tier
a smart enough model should understand the difference between the real world and the slop world though
>>
https://youtu.be/ADj5OjDJhL4?t=377
>Imagine paying 200 dollars per month to use an i2v model that can't do i2v proprely
>>
>>103470007
I mean, you could filter out everything made on 2022 or later if you want to be overkill.
>>
Anyone else just getting a black screen when generating hunyuan videos? It might be that I can't get SageAttn2 installed, but I want some feedback
>>
>>103470050
>>
>>103470050
>you could filter out everything made on 2022
unironically that's a great idea, there's a shit ton of images pre-2022 already
>>
>>103470050
is sourcing the creation date any easier?
>>
File: HunyuanVideo_00119.webm (757 KB, 544x960)
757 KB
757 KB WEBM
>>103470054
talking into the microphone is very clever, well done anon from last thread
>I can't get SageAttn2 installed
why

>>103470055
thats not slop thats deep dream and slop outnumbers deep dream stuff 100 to one
>>
>>103469686
I'm not sure how much it contributed, but most images of women are shopped or have beauty filters on, which would create a bias against fine textures on faces in the training data.
>>
File: 1705120580499471.png (1.39 MB, 1547x1594)
1.39 MB
1.39 MB PNG
https://github.com/comfyanonymous/ComfyUI/pull/5975
>Add MaHiRo (improved/alternate CFG)
Interesting
>>
>>103469996
Thanks man, I just reset all settings and it's working again. I would have given up if you didn't tell me it was possible to fix.
>>
What are the optimum settings for 3090? Like what is the absolute limit in terms of resolution/framerate/video duration etc?
>>
>>103470071
Can you post prompt or catbox?
>>
File: HunyuanVideo_00248.mp4 (470 KB, 960x544)
470 KB
470 KB MP4
>>
>>103470110
what happens when you ..\python_embeded\python.exe -s -m pip install distutils and try the sageattention install stuff again

>>103470124
literally the microphone prompt last thread (ctrl F for ASMR) but replaced "elf" with "teen", 50 steps
>>
>>103470071
Jesus
>>
File: out.webm (182 KB, 320x480)
182 KB
182 KB WEBM
>>
File: HunyuanVideo_00124.webm (926 KB, 960x544)
926 KB
926 KB WEBM
lol
>>
File: out.webm (54 KB, 320x480)
54 KB
54 KB WEBM
>>
File: HunyuanVideo_00125.webm (741 KB, 544x960)
741 KB
741 KB WEBM
braces dont look the best
>>
File: 1716061141294237.mp4 (184 KB, 544x320)
184 KB
184 KB MP4
Has anyone tried the different finetunes of clip_l on Hunyuan? like going for SAE or Smooth instead of the base model? I notice some difference but I can't tell which one is the best
>>
Why the fuck do my gens look blurry as fuck compared to the ones posted ITT?
>>
>>103470285
show a screen of your workflow anon
>>
>>103470293
Literally copy pasted from the example on Kijai github page.
>>
File: 1633781189100.png (2 KB, 158x160)
2 KB
2 KB PNG
>mfw
>trying to make gens on a 2080
>>
https://files.catbox.moe/7l2kyv.mp4
>>
>>103470303
he's using low resolutions, go higher if you want something more crisp
>>
>>103470313
It's just resolution? Not iterations or other settings/upscalers that bump up the Crispness?
>>
File: 1729021563877842.png (66 KB, 620x453)
66 KB
66 KB PNG
>>103470306
Never been happier than with a 3090
>>
>>103470318
not really, there's not much to change on hunyuan desu, the steps, the resolutions, the number of frames, that's it
>>
>tfw couldn't get flux to do what I wanted
it's owari da
>>
>>103470319
I tried to buy a 4090 even though the 5k series is next month. But even now I fucking can't find one, except for scalping cunts going for 3.5k
>>
>>103470349
wait for the 5090, would be a mistake to buy a 24gb card now when a 32gb one will be released in a month
>>
Hey how good is image generation nowadays? I'm right now genning HunYuan videos like everyone else ITT but it makes me wonder how good image generation has become. Last I did anything with it was 2 years ago and looking at threads about image generation it's filled to the brim with so much new vocabulary/terminology that it's completely impenetrable if you have a job and can't spend an entire weekend figuring everything out.
>>
>>103470349
>check local retailer
>4090 for 2.7k euros
What the fuck? I bought mine for about 2k on the day of release. What happened? Mining hasn't been profitable on gpus since eth moved to pos.
>>
File: 1702678978710140.png (1.53 MB, 1024x1024)
1.53 MB
1.53 MB PNG
>>103470351
>Hey how good is image generation nowadays?
great, we have Flux dev, finally a model that's as good as the API ones in terms of making good anatomy humans
>>
>>103470362
>What happened?
>Sitting in a thread where people squeeze the absolute maximum of the scraps of vram they have while companies spend billions on GPUs and vram to train the models
>HUUR GUYS WHY IS VRAM SO EXPENSIVE NOW?!

Are you actually retarded?
>>
>>103470362
>What happened?
local models got better and everyone want to buy a 4090 to enjoy flux and hunyuan now kek
>>
>>103470374
>>103470377
That's still very niche. Normies aren't buying them for that.
>>
>>103470384
So you think even a few thousand hobbyist buying up every loose GPU won't jack up the prices on the used market?
>>
File: HunyuanVideo_00128.webm (697 KB, 544x960)
697 KB
697 KB WEBM
hopefully the release of the 5090 lowers the price of the 4090. the 3090 is already obsolete
>>
File: 1730415422359536.jpg (334 KB, 3691x807)
334 KB
334 KB JPG
>>103470384
>niche
Flux dev has been used more than SoulCalibur 2
>>
>>103470362
I don't know about your particular region, could be simply they are clearing stock and as there is limited stock they can dictate the price before the next series launches
>>
File: out.webm (83 KB, 320x480)
83 KB
83 KB WEBM
>>
>>103470415
>the 3090 is already obsolete
as much as I hate to admit it, I should've brought a 4090 instead of the 3090, it's obvious it's the only card that can handle those new giant models proprely
>>
can hunyuan do loop videos like sora?
>>
>>103470415
The 4090 will be obsolete when 5090 is released.
>>
File: 1716368684592420.png (112 KB, 1616x995)
112 KB
112 KB PNG
>>103470455
no it can't, we need the i2v model for it to work, it's on their checklist
>>
>>103470456
3090 holds up well, I think 4090 will too
>>
>>103470471
the 3090 is a joke compared to the 4090, if you want to go for 960x544x97f, it takes 20 mn for the 3090 and only 6 mn for the 4090 (because that one can support fp8 torch compile + fp8_fast)
>>
>>103470471
>>103470456
There is only one number that matters and that's the number of GB of vram at your disposal. Everything else is just window dressing.
>>
>>103470477
Yeah this has been my experience as well comparing my 4090 vs 3090
>>
I got my 3090 for $300 so I still think it was worth it over buying a 4090 for 3 or even 4x the price.

That said I will still upgrade to a 4090 if it ever reaches a similar pricepoint as when I bought the 3090.
>>
>>103470480
the speed is important too, I wouldn't want a 3060 with 48gb of vram, that would still be too slow
>>
File: HunyuanVideo_00129.webm (764 KB, 544x960)
764 KB
764 KB WEBM
>>103470456
it won't be obsolete because it can do video
its inferiority depends on how much tech the 5090 gets that the 4090 does not
>>
>>103470489
>I got my 3090 for $300
Did it fall off the back of a trailer
>>
Made a 544x960 with 30 steps and it crashed my 3090?

How do you guys manage to generate at that resolution?
>>
File: 1709934750125069.mp4 (1.3 MB, 960x544)
1.3 MB
1.3 MB MP4
>>103470497
the 5090 will be the most wanted card ever, because with 32gb you'll be able to finetune flux and hunyuan without having to go for some block swap meme and shit
>>
>>103470500
Nah it was "used for mining". It has 0 issues at all but people are retarded and think if it's used for mining it's unusable. Even though miners underclocked and undervolted the GPU to save on energy, so if anything it's less "used" than gaming GPUs.
>>
>>103470505
It just works? Idk why its not working for you. Are you using sage?
>>
File: 1727314329806160.png (172 KB, 1420x1074)
172 KB
172 KB PNG
>>103470505
>How do you guys manage to generate at that resolution?
look at your task manager, if it overflows your card then decrease the number of frames, I can barely make it for 97 frames
>>
>>103470505
you need sage, it's way more memory efficient than spda
>>
>>103470510
>Nah it was "used for mining". It has 0 issues at all
yeah, say what you want about Nvdia, but their cards are really resistant, they don't break that easily
>>
File: out.webm (229 KB, 480x640)
229 KB
229 KB WEBM
>>
>>103470517
>>103470523
I use sage

>>103470518
Frame was 85 and steps 30. It also showed it was already at 100% before it gave an error. My gut feeling says it has to do with actually encoding the video not generating it, but I don't know enough about the workflow to know why it would crash like that. Have no other issues at lower resolutions though.

It crashed at the decode step.
>>
File: 1711140670207439.png (54 KB, 881x501)
54 KB
54 KB PNG
>>103470552
>It crashed at the decode step.
did you update his node? you can make the decode step less memory intensive by reducing some of its values
>>
>>103470497
you can only get that in china but it's open source?
>>
I wonder if the quality is significantly worse at lower resolutions because they trained on very old video files/porn that were very low resolution and low quality. I also notice tiktok vertical videos are higher quality.
>>
>>103470605
>I wonder if the quality is significantly worse at lower resolutions because they trained on very old video files/porn that were very low resolution and low quality.
every model perform worse at low res, try to go for 512x512 on Flux, it looks fine, but it's not even close to the quality of 1024x1024, you're not helping the model by giving it less pixels to work with, hell I wouldn't draw well if I was giving a tiny square instead of a full A4 paper
>>
https://files.catbox.moe/c8rt8t.mp4

Is this really the best the model can do? Why is the motion so blurry compared to other videos ITT?
>>
>>103470623
Framerate is fucked
>>
>>103470632
Can you give me some good settings maybe?
>>
File: 00047-503345820.png (2.02 MB, 1200x1208)
2.02 MB
2.02 MB PNG
what's the status on video generation? it always seems to be some chinese website rather than something i can run myself
>>103470505
no issues with automatic1111 for over a year now
>>
File: 1707050709047423.png (39 KB, 1371x359)
39 KB
39 KB PNG
>>103470623
you went for 16fps, the model is supposed to work at 24
>>
Will I even be able to buy a fucking 5090 on launch? Everyone wants that card, so there will be scalpers and bots buying EVERYTHING. How the fuck am I suppose to secure one
>>
>>103470651
I wish there was an "early access" for graphic cards like video games kek
>>
>>103470651
when I tried to buy a 4090 from nvidia it became unavailable as i entered my information. people turn it into a sport.
>>
>>103470651
I solve that issue by having a bot buy it for myself. I did the same with the steamdeck and the 3090 when it just launched. Believe it or not most people just quickly rush to a storefront website to order it within the first 30 minutes it launched. If you build a bot on something like amazon or the Nvidia site you will be basically guaranteed to be able to buy one.

Since this is /g/ it will take you about 1 hour of quick reading to set one up. It's very trivial and there are pre-built frameworks for this.
>>
File: 1721046486839764.mp4 (1.7 MB, 960x544)
1.7 MB
1.7 MB MP4
>>103470276
>like going for SAE or Smooth instead of the base model?
I went for clip_l SAE for that one, that's good
https://huggingface.co/zer0int/CLIP-SAE-ViT-L-14
>>
>>103469437
>>103469708
wow we have ethically sourced organic models now
>>
File: 1490991839858.jpg (26 KB, 480x451)
26 KB
26 KB JPG
People complaining about not being able to buy a 3090/4090/5090 at launch are wrong.

It's essentially a "technical literacy" test. If you can build and deploy a bot to get one of those cards at launch it means you probably will actually utilize the capabilities of the card so you deserve first pickings. If you are too lazy to deploy a bot to buy it for you then you probably didn't want the card that much anyway. Or you're not capable enough to use it to utilize it for rendering/AI/CUDA anyway and will just it to get slightly higher framerate at whatever bullshit videogame anyway.
>>
File: 200%.jpg (234 KB, 500x738)
234 KB
234 KB JPG
>>103470674
May have to do just that. I'm not sure I can enter my info on the store faster than the scalping bots. Could also try my luck by camping at a micro center.

Seriously that you have to camp outside of a fucking store in 2025 to get a GPU is pants on head fucking retarded. They need to produce enough of these fucking things to keep them in stock
>>
File: HunyuanVideo_00133.webm (1 MB, 960x544)
1 MB
1 MB WEBM
>feet transition

>>103470507
>the 5090 will be the most wanted card ever
i know i want one
>with 32gb you'll be able to finetune hunyuan
do we know how many videos we need to finetune video models yet?

>>103470601
>you can only get that in china but it's open source?
what? no follow the guide in the OP for local video
>>
>>103470703
some people also aren't scumbags who ruin nice things for everyone else
>>
>>103470706
>Seriously that you have to camp outside of a fucking store in 2025 to get a GPU is pants on head fucking retarded.
it's the ps2 launch all over again, you know a product is good when everyone fight to the death to get it on day 1
https://www.youtube.com/watch?v=zBDah-cNFXE
>>
>>103470720
I am genuinely pissed that this shit still happens. There shouldn't be a need to run a bot to secure a god damn GPU
>>
>>103470350
the used market of 4090/3090 will be quite nice, if only you could used 2-3 to have effectively 2-3x24GB in generation
>>
>>103470729
What happened to SSLI?
>>
>>103470727
>I am genuinely pissed that this shit still happens.
that's the point, they make less cards than the demand so that it's rarer and more expensive, they make more money by sellings less cards but more expensive cards, welcome to capitalism 101
>>
>>103470362
nvidia stopped producing the chips, so stock is dwindling while the demand is still there
wait for the 5090 or buy a used 3090
>>
>>103470706
>>103470716
Yeah fuck the scalpers, I hate them too. But building a bot to buy one isn't that hard it's not like you actually need to code anything. If you can setup HunYuan you can also setup a bot to buy it for you at launch.

Make sure to cache all store pages on your browser as well so that when the server is overloaded your bot can just easily bypass as much as possible.

To give you some indication I'm usually within the first 500 people in the world to buy a 90 series GPU because of my bots. And I just use a framework you can find online.

I don't want to spell out how to do it exactly because if it just becomes a script for people to doubleclick on it will be so trivial that even stupid gamers will start doing it instead of people like us that actually need it for production work.
>>
>>103470743
I'm a massive retard and also lazy. Whats the framework? I want to build it now so it can run when the thing releases. I will have that god damn GPU
>>
>>103470638
The man is clearly referring to video generation. But you're a literal caveman using a1111 still.

What the fuck. How can you still be using that broken ass front end?
>>
>>103470703
I'm a dev but I'm not going to exert a disproportional amount of effort to BUY A PRODUCT, that in my view is peak consumerism retardation.
I'm not going to pay some scalper either, I'll just wait until I can get one in the normal way.
I've been fine without <new thing> for months, I'll be fine for a bit longer.
>>
>>103470743
The fact that I don't even know the price of the 5090 yet but will still buy it regardless of price pisses me off. I'm going to drop almost 2-3k on a single GPU? It sounds so pathetic
>>
>>103470788
>I'm going to drop almost 2-3k on a single GPU? It sounds so pathetic
as if we had the choice, they have the monopoly they can fuck us in the ass if they want
>>
>>103470796
>they have the monopoly they can fuck us in the ass if they want
How can they have a monopoly if monopolies are illegal?
>>
>>103470788
I used to think my friends who were into car tuning stuff spent way too much into their hobby.
Now I kind of understand.
>>
File: HunyuanVideo_00008.mp4 (1.66 MB, 544x960)
1.66 MB
1.66 MB MP4
Ukraine if it JRPG
>>
File: HunyuanVideo_00136.webm (701 KB, 960x544)
701 KB
701 KB WEBM
ass transition is better
>>
File: 1728200279156451.png (572 KB, 1080x1004)
572 KB
572 KB PNG
>>103470813
>monopolies are illegal?
they aren't technically in monopoly because AMD exist, but AMD exists just to give the illusion, nothing else
>>
>>103470787
I'd be fine if I had a decent card. But I'm rather poor so I've only got a 2080. If I want to continue with flux / SD / Hu then I'll need the equipment. I couldn't even get a 4090 and the 3090 will be worthless soon enough.

>>103470796
I don't know how they're allowed to have a monopoly on the cards. US even knows NVIDIA is the only option for AI so they're trying to ban it for China. AMD has flat out said they want nothing to do with cutting edge models and Intel has no hope of competing. So what fucking choice do we have? Fuck it, Daddy USA needs to take over NVIDIA at this point
>>
>>103470727
The only way to avoid it is to inundate the market with so many products the scalpers lose money.
That's what happened with the PS5 pro, plenty scalpers were forced to sell at a loss on ebay because sony made a lot of them.
>>
>>103470832
you know damn well nvidia won't mass produce it. They'll do custom orders to the big AI companies and let us have the few cards remain. It wouldn't be so bad if gaymers didn't try and get them
>>
>>103470738
If you mean nvlink, only the 3090 has it, and it doesn't matter, the system still sees two cards, and the only way to see one would be for someone to actually develop a way for that.
>>
>>103470830
>US even knows NVIDIA is the only option for AI so they're trying to ban it for China.
it'll be a good move in the long run, China will do like AI, they'll make great GPU cards by themselves and they'll be less expensive
>>
>>103470826
how do you get these random ass transitions?
>>
>>103470856
>china will make cards
bro what? China doesn't have the capacity for that shit. They basically just steal tech and try and imitate it. Theres not a chance they can replicate a 5090 equivalent. Also are there even card manufacters in china...?
>>
>>103470872
>They basically just steal tech and try and imitate it.
not anymore, they are improving AI without the help of the west now, look at MiniMax, look at Hunyuan, look at QwQ
>>
>>103470826
Kino
>>
what's the difference between software and hardware ?
>>
>>103470796
I mean, can't we just get a 4090 used when people sell them to upgrade to the 5090? Probably get the thing for near MSRP and it'd be way less of a fucking headache. Then get the 5090 in like a year when shit calms down
>>
File: 1731754714782095.png (226 KB, 1815x1312)
226 KB
226 KB PNG
that's why BitNet must be a thing, with BitNet you don't need MatMul operations, meaning that you don't need complex GPU anymore, that shit would kill Nvdia's monopoly
https://arxiv.org/abs/2402.17764
>>
China can't produce on a node smaller than 7nm and all their 7nm nodes are already booked for the next 5 years for Huawei networking equipment.

China has been stuck on 7nm for 6 years already as they don't have EUV machines. China is losing relevance in the chip industry as the gap between them and the west is growing every year due to them stagnating at 7nm. They are marketing a 6nm and 5.5nm node this year but they are just rebranded 7nm nodes.

China will not be building top of the line GPUs very soon. They might make monstrous GPUs that eat 3000W and consist of 5 interconnected chips that compete with a 5090 and maybe to train models, but in terms of efficiency or production cost they won't be competitive at all.
>>
>>103470903
>software
the code (python, C++...)
>hardware
the CPU, the GPU...
>>
>>103470813
Laws are not just about the text that is written down but also how the text is interpreted and enforced.
For monopolies in particular you need federal agencies to take action or nothing is going to happen.
But since Trump's administration in his first term was firmly on the side of billionaires and corporations I don't think anything is going to happen in the foreseeable future (unless there is more lobbying in the opposite direction).
>>
>>103470903
Video is still generating: software
Video is done: hardware
>>
File: 1725366252390107.png (2.15 MB, 1202x1112)
2.15 MB
2.15 MB PNG
>>103470933
>But since Trump's administration in his first term was firmly on the side of billionaires and corporations I don't think anything is going to happen in the foreseeable future
Trump likes AI though, that means that the companies will be less scared to go for based models and we'll get less censored shit
>>
>>103470923
>>103470943
Thank you
>>
>>103470947
Yea but he's not going to do anything about the rampant scalping and price gouging. He lives for that shit. He'll let NVIDIA run wild. Hell I expect the card to go for 2-2.5k at this point
>>
>>103470973
I mean, who has even put a stop at Nvdia? They are dominating the market since the early 2000's
>>
File: HunyuanVideo_00137.webm (1.15 MB, 960x544)
1.15 MB
1.15 MB WEBM
rare thigh transition

>>103470857
i just prompt for "the video cutting to their bouncing butts and bare feet" so sometimes i get feet (works bad, took it out) or butts (works ok i guess)
>>
>>103470982
I'm just pissed that I'm going to have a hard time securing a 5090. I'm not a poorfag so I'll drop the money, but it shouldn't be so hard to get the hardware
>>
>>103470985
>the video cutting to their bouncing butts and bare feet
oh I see
>>
File: 1718666021260692.png (221 KB, 2068x957)
221 KB
221 KB PNG
>>103470985
can you try for that setting and see if it listens to your prompts better >>103470688
>>
How do you feel that normalfags can use Stable diffusion and Hunyuan? Even my giga-normie brother is using SD with Comfy
>>
Why are people using fp16 instead of bf16?
>>
>>103471009
the text encoder is fp16, the rest is bf16 on Hunyuan
>>
>>103471004
Based, the more normalfags are like us, the better our world will be
>>
>>103471025
the more people are using it, the more outrage there will be though, everything that turns into mainstream turns to shit
>>
>>103471004
Not based. Fags need to keep out of my hobby. If you can't use python or C you have no right generating AI. Get the fuck out
>>
>>103471034
This, you know normalfags getting in on AI will cause the government to regulate the shit out of it faster
>>
>>103471009
Assuming all numerical values are in the range representable by FP16 there is literally no benefit to using BF16.
>>
>>103471047
>there is literally no benefit to using BF16
there's no drawback either right? they're both 16 bits so neither of them are using more vram, the thing is that not all gpus can support bf16, so if you can't run the model because of that then it's time to go for fp16
>>
>>103471034
Outrage depends on whether the elite decides to generate it through media, seems like AI isn't really a subject of media focus anymore. If new tech is discovered that gives us 100x improvement, I assume anti AI propaganda would ramp up massively. If it doesn't threaten their power, it's fine, especially if it can give them plausible deniability for recordings.
>>
Just use bf16 for everything people, 3090/4090 both support it and it's faster
>>
>>103471068
There are two modes for FP16/BF16 tensor cores on NVIDIA GPUs.
The inputs are always FP16/BF16 but the accumulator can be either FP16/BF16 or FP32.
If the accumulator is FP32 then BF16->FP16 conversion makes virtually no difference.
If the accumulator is 16 bit then you would in principle get a lower rounding error with FP16 though I highly doubt that it's going to make a significant difference for the inference of generative neural networks.
>>
>>103471085
>Just use bf16 for everything people
I wouldn't use bf16 for the fp16 text encoder though, the fp16 -> bf16 conversion decreases the quality a little
>>
File: 1733597139064486.mp4 (227 KB, 960x544)
227 KB
227 KB MP4
>>
File: 1716059813550551.png (227 KB, 1829x613)
227 KB
227 KB PNG
https://huggingface.co/tencent/HunyuanVideo/discussions/15
lmao
>>
File: HyVid_00006_.webm (2.14 MB, 960x544)
2.14 MB
2.14 MB WEBM
>>103471549
lol wut? the only thing I saw the eu do recently is make faulty soft/hardware liable for litigation but that doesn't apply to open source
>>
File: 1727207480237065.png (249 KB, 960x703)
249 KB
249 KB PNG
>>103471569
>the only thing I saw the eu do recently is make faulty soft/hardware liable for litigation
Sora can't be used in the eu aswell
>>
>>103471569
>companies start open sourcing their shit under the most bullshit licenses to get around litigation
based
>>
File: HyVid_00007_.webm (1.62 MB, 960x544)
1.62 MB
1.62 MB WEBM
>>103471578
kek

>>103471581
indeed. very based
>>
>fishing for attention
>>
File: 1708517237760272.gif (957 KB, 256x320)
957 KB
957 KB GIF
>>103471581
every new day, when I think the chinks can't be more based, they surpass my expectations even more
>>
File: HunyuanVideo_00012.mp4 (1.04 MB, 960x544)
1.04 MB
1.04 MB MP4
This is pretty epic ngl
>>
>>103471578
>>103471581
I dont get it, what's the point?
>>
>>103471600
kek
>>
>>103471602
>what's the point?
the eu hate AI, so they decided to move out of the race and become a set of third world countries, seems fitting because they also love to importe the third world country into their own
>>
>>103471602
Stopping companies from selling promises, then delivering pajeet tier software. It will probably have some weird down stream consequences, like >>103471581
>>
>>103471594
https://arxiv.org/abs/1706.03762
>>
File: HunyuanVideo_00150.webm (1.18 MB, 960x544)
1.18 MB
1.18 MB WEBM
https://files.catbox.moe/fmla01.webm
this is the worst its ever going to be
>>
>>103471756
The internet will overflow with cunny at this rate
>>
accelerate
>>
File: hunyuanlora.png (25 KB, 973x419)
25 KB
25 KB PNG
of course there is fucking snitches
>>
>>103471615
>>103471619
So which one is it?
>>
File: HunyuanVideo_00154.webm (1.31 MB, 960x544)
1.31 MB
1.31 MB WEBM
wtf was that the ghost of harambe

>>103471830
this will save the children, probably, maybe
>>
>>103471569
>>103471585
gm ani
>>
>>103471905
Both, regulations always fuck something up, that you can know for sure.
>>
File: HunyuanVideo_00157.webm (1.14 MB, 960x544)
1.14 MB
1.14 MB WEBM
>>
File: HunyuanVideo_00158.webm (1.07 MB, 960x544)
1.07 MB
1.07 MB WEBM
>>
>>103471004
Normies use a1111 aio install or online stuff.
Only people with minimal technical skills use comfy.
And it doesn't matter.
>>
We're getting so many Hunyuan tools, can't wait.
>>
>>103471578
it's just a question of time and probably gdpr stuff
>>
>>103471904
what snitch? it's nothing hidden or bad?
>>
>western devs
NOO THERE'S A 0.0001% CHANCE IT WILL GENERATE A LOLI WITH A 2000 WORD JAILBREK PROMPT, IT'S DANGEROUS!!
>chinese devs
我们的模型似乎偏向于生成萝莉,但性能很棒,让它成为锦上添花
>>
File: HunyuanVideo_00011.webm (3.34 MB, 1280x720)
3.34 MB
3.34 MB WEBM
>>103468004
>>
>>103472252
>pregnat whore dual wielding vapes at a soulless advertisement optimized diversity party
Esoteric prompt
>>
>>103472252
nice quality, how long did it take you to gen that video?
>>
>>103472250
lol did they write that
>>
>>103472262
The people are supposed to cheer in reaction to the blue smoke but we're not quite there yet.

>>103472268
43 minutes on an RTX 4090 (headless Linux server).
>>
File: HunyuanVideo_00084.mp4 (525 KB, 960x544)
525 KB
525 KB MP4
>>103472287
43 minutes huh, damn, nj tho
>>
File: HunyuanVideo_00162.webm (1.25 MB, 960x544)
1.25 MB
1.25 MB WEBM
>>
Prompt rewrite models will be huge going forward. With human preference scoring you could do RLHF for video outputs and fine tune an LLM. But why did they elect for an absolutely gigantic rewrite model for Hunyuan?
>>
File: HunyuanVideo_00087.mp4 (480 KB, 960x544)
480 KB
480 KB MP4
>>
>>103472284
No, but actions speak louder than words
>>103470021
>>
>>103472469
cope or not, I'm kinda glad I don't have 4090. I would end up breaking my dick
>>
Do you guys train models?
>>
>>103472350
>>103472469
I'll follow these asses to the end of the universe.
What prompt did you use, and how much garbage did you get vs nice output?
>>
>>103472875
I'm using the prompt template structure:


Describe the video by detailing the following aspects: 1. The main content and theme of the video.2. The color, shape, size, texture, quantity, text, and spatial relationships of the objects.3. Actions, events, behaviors temporal relationships, physical movement changes of the objects.4. background environment, light, style and atmosphere.5. camera angles, movements, and transitions used in the video


a 22 year old beautiful woman walking from behind, shes wearing a pink thong underwear, A low angle fish-eye lens 35mm shot that follows the woman buttocks closely
>>
>>103470850
>gamers
>5090
please stop being so dishonest
>>
>>103470872
>China doesn't have the capacity for that shit. They basically just steal tech and try and imitate
why do people keep saying that? all the best chips in the world are made by tsmc in taiwan. what do you think taiwan is? they're all chinese. nvidia ceo is chinese + half the people that work there, if the chinese government asks for the schematics, they will hand it over on a email.
>>
>>103469987
it's better than flux because it's pretty unfiltered so even if it was trained on the wrong captions it was trained on the right video clips
which means a Matrix lora would likely quickly turn it into a fake Matrix clip generator with the right actors
>>
>>103471904
>bro people are writing code for your open source model, creating what they call "Loras" as a new AI model maker I'm sure you've never heard of them
>>
>>103470920
i don't know if you ever looked at a map but taiwan is not the west. they're all chinese. if america keeps bothering them with taiwan, half the people that work for those chip companies will defect to mainland china.
>>
>>103472923
>I'm using the prompt template structure
what is this?
>>
>>103471046
crippling the west and ceding victory of the ai race to china who won't be regulating anything and as a fuck you they will release open source models to humiliate westoids.qq
>>
>>103473034
mainland Chinese culture is very different from taiwanese one
the PRC is hindering their own population
>>
>>103472923
thanks anon!
is it like flux where an input LLM for text is better than writing it ourselves?
>>
>>103473080
nevermind I'm retarded
>>
>>103472574
there are anons here that do foundational model training
>>
>>103473034
>if the chinese government asks for the schematics, they will hand it over on a email.
I think you need to study the region inner workings a little more.
>>
>>103473100
>mainland Chinese culture
Well to be honest this varies greatly inside mainland too. When it comes to business mainland and TW cooperate a lot.
>>
>>103473100
>mainland Chinese culture is very different from taiwanese one
maybe 30 years ago, not anymore. china is a very modern country, a lot of it rivals and even beats taiwan, in terms of business and innovation.
>the PRC is hindering their own population
how so?

>>103473138
i think you need to study the chinese as a race. you think they're going to screw over their own people over a quick buck and gay rights? they hate gays, even in taiwan. as much as the us controls their leadership and pushes for conflict, the taiwanese people won't go along with it. nothing more devastating for them than going against china. everyone there knows this.
>>
>>103473112
No, its more like the LLM is between your text prompt and the video, in this case the LLaVa LLaMA model, it follows the instruction that is set(>>103472923), in the wrapper you can disable tho, so maybe you can try your custom prompt you can also change the instruction, maybe we can even change the model, but i don't know if there is more vision models, I also dont know why its a vision model, I wonder if we could change it to a 3.1 model
>>
>>103473198
Why people love talking about the Chinese in absolutes, they are literally billions, do you think they all think alike? its literally impossible, anything thats supposed to be banned there, still is produced/generated
>>
>>103473201
OK makes sense.
Is there any ready made full comfy template with these people here use?
>>
>>103473239
>Why people love talking about the Chinese in absolutes,
because they are
>they are literally billions, do you think they all think alike?
yes
>its literally impossible,
it's not
>anything thats supposed to be banned there, still is produced/generated
because the government allows it (doesn't crack down on it) and chinese are greedy
>>
>>103473198
>i think you need to study the chinese as a race
A person whose race is "Chinese" (Han) born in the US would be different from a person in costal China, and different from a Taiwanese.
Culturally they're very different, Taiwanese are more polite and make me think of Japanese people in a way, while mainland Chinese mentality is still scarred from the great leap forward and has less manners.
>>
>you think they're going to screw over their own people over a quick buck
>doesn't know about the 2008 Chinese milk scandal
>>
File: HunyuanVideo_00169.webm (1.11 MB, 960x544)
1.11 MB
1.11 MB WEBM
>>
File: HunyuanVideo_00168.webm (1.18 MB, 960x544)
1.18 MB
1.18 MB WEBM
hunyuan cant do nuclear explosions in the background well, but understands that there needs to be clouds/fire back there. would be a good thing to train a lora for
>>
>>103473276
true for everything, but as america looses it's identity, chinese-americans will cling to their china roots. taiwanese are like that because they've been richer for longer and as the mainland develops more and more the mainlanders will become more like them. the plan for taiwan annexation was long term, over the next 40-50 years the chinese government expected taiwan would be naturally incorporated as mainlanders reach cultural and economic parity, it would just sort of happen. it's america that goes over there, sails their aircraft carriers in the south china sea and creates all these problems that push china to be aggressive. the chinese are naturally very conflict avoidant.
>>
>>103473286
yes chinese are greedy, but they are also smart. they know there is no future with the us. the only people that would go along with a war against china are literally traitors, most likely gay, and would not enjoy the support the population.
>>
File: HunyuanVideo_00177.webm (1.62 MB, 960x544)
1.62 MB
1.62 MB WEBM
>>
>>103473314
I think is just the text encoder, llava is cucked so the prompt is may change too. That is unsafe anon kun gen other thing. Or but as an AI model I don't suport. etc, etc
They train a whole LLM just for this for some reason
>>
Anyone ever get this whe using torch compile setting? I've tried everything! reinstalled comfy, torch etc....

"HyVideoSampler

backend='inductor' raised:
FileExistsError: [WinError 183] Cannot create a file when that file already exists"
>>
>>103473552
>They train a whole LLM just for this for some reason
you talk about MLLM right? they're already using it on their API, I wonder why they haven't released it on huggingface yet
>>
>>103473910
I don't know anon, some kind of bureaucrat shit
>>
>>103469437
i think this is just for downloading the public domain images and the model will be open
but the examples they posted are just painterly slop any model can make
>>
>>103469708
>Model trained exclusively on public domain artwork that they're going to charge $10/month for
They said it'll be local and open sauce. Please work on your reading comprehension.
>>
>>103469437
That's the dataset explorer. A new model that knows artists names in this day and age is very welcome. The explorer might be useful to create artists loras. I still haven't found Flux Loras of many artists I like. I am too lazy.
>>
>>103469437
When you download the image it gives you three files
The image shows a painting of a group of people sitting around a tree, with a woman standing in the center holding something in her hand. There is a vessel in front of them, and in the background there are trees and a clear blue sky.

and a json that's too big. I don't have a clue about training loras for Flux, but I guess that is going to be useful.
>>
>>103473973
>A new model that knows artists names in this day and age is very welcome
That's not their goal, (I think) they're attempting to show that one can create a good foundational model with no knowledge of specific artists - then after the fact artists would create loras from their own work or work they got permission to use. Is the idea retarded? Perhaps. Will the base model be good? Perhaps. But, their candidate model is built on Lumina-Next so take that as you will.
I'm giving it a modicum of chance because it's the first of it's kind as far as I'm aware.
>>
img2vid bros, we cooking! Expect exciting news soon
>>
Bakermanii
>>
>>103474102
>Expect exciting news soon
yeah we know, Hunyuan will release their i2v model soon enough
>>
What happened to Black Forest Labs' video model?
>>
>>103473973
>charging $10/month for public domain images
>>
>>103474147
canned because too dangerous and unsafe
>>
File: 1705666118691358.png (308 KB, 640x660)
308 KB
308 KB PNG
>>103474151
OpenAI charges 200 dollars a month for public domain videos
>>
>>103473971
collective ldg IQ is single digit
>>
>>103473276
BFL niggas be like yo we dropping the sickest model ever fr fr fr and then disappeared
>>
>>103474165
I don't give a fuck what people say, I would Mira Murati
>>
>>103474165
OpenAI is copyright infringement all the way down anon
>>
>>103473286
>2008 Chinese milk scandal
are you implying that the west haven't gotten any scandal ever?
>>
>>103474151
Are you illiterate or which part of the chart was confusing
>>
File: 1727013336965395.png (1.17 MB, 1486x960)
1.17 MB
1.17 MB PNG
>>103474195
>I don't give a fuck what people say, I would Mira Murati
I don't think what you said is controvertial at all, she's pretty attractive
>>
>>103474173
gm
>>
>>103474200
>copyright infringement
yeah, I know, and they charge that 200 dollars a month, but the model doesn't say nigger so we're safe and that means they're the good guys!
>>
>>103474147
Black Forest Labs has no more investment. it's dead in the water since nobody can make anything useful without the teacher model
>>
>>103474217
>which part of the chart was confusing
All of it, please explain what they are charging for
>>
>>103474236
kekd
>>
>>103474195
Ehhh, for a Silicon Valley girl she's alright. If I had to choose between fucking Mira or Caroline Ellison, I'd choose Mira
>>
>>103474235
Aren't they collaborating with X?
>>
File: herndon.png (749 KB, 640x640)
749 KB
749 KB PNG
>>103469437
apologize
>>
>>103474208
Last I heard central planning has issued a directive and their primary schools are now highways, to support the population of course.
>>
>>103474261
Elon used BFL like a cheap whore
>>
>>103474261
how is that revenue for them when elon gets the majority of sub money? they have been made into elon's little bitch and have nothing more to offer since china will just train a new flux arch without censorship
>>
>>103474306
can it do downsyndrome?
>>
>>103474192
the tokenizer is not compatible. We could try a uncensored versions of llava
>>
>>103474316
yes, prompt emma watson
>>
>>103474261
do you see elon giving them any sort of mention on the grok image gen? they have been cucked to oblivion not to mention the model IS censored. can't even get proper nipples or vag at all
>>
>>103468004
id pay money for foot clips like that. where that genner at?
>>
bakersama?
>>
>be investors
>give SAI money for SD3
>dont get SD3, employees fucked it up
>the same employees come with their hand out, say they'll train SD3 properly for more money
>super pissed but ok its cheaper than giving SAI more money
>they train Flux (SD3), its ok
>give them to Elon to play around with then discard
If you demand more money to do something you've already been paid for it's the last money you'll get
>>
>>103474334
kek, it's funny because it's true
>>
>>103474373
>id pay money for foot clips like that.
you should pay a 24gb card instead and gen them by yourself
>>
>>103474147
I've learned to never trust an AI company when they say "soon". Release something or STFU.
>>
>>103474413
id also have to learn genning which i currently dont want to do
>>
>>103474444
you learn this shit once and then it's just writing waifus with close up feet and press generate, completly worth it
>>
>>103474055
I have started training with 45 images from the source.plus site, if this works I'll do N.C. Wyeth next, but first I had to do Waterhouse.
>>
>>103474444
literal indians living in tin shacks are doing this for money on fiver right now you have zero excuse
it's braindead easy to gen, the hard part was installing this gay shit whenever ((dependencies)) didnt feel like cooperating.
>>
>>103474444
quads of truth fuck imggen
>>
File: HunyuanVideo_00028.mp4 (447 KB, 544x960)
447 KB
447 KB MP4
>>103474373
I made it.
Don't pay for AI unless it's upgrading your own hardware to gen better (or renting a GPU I guess)
>>
>the only anon to really figure out hunyuan is also a pedo
>>
>>103471569
kyss faggot
>>
>another thread full of uncanny valley chink slop
i don't think anyone has figured it out, anon
>>
>>103474496
jej. had a good chuckle

okay
>>
>>103474541
of course, comes with the territory.
>only people to mod one of my favorite games is from a pedo forum
>pedos seem to be the only ones that can bring most jannies on this site to heel
>they seem to also pop up at random to save dead projects or topics too
their power level is intense.
>>
File deleted.
>>103474603
>one of my favorite games
what game
>>
File: onirism.png (553 KB, 949x382)
553 KB
553 KB PNG
>>103474659
Onirism, game so based it's banned from discussion on /v/.
>>
>>103474603
how do they do it?
>>
>>103474674
>Onirism, game so based it's banned from discussion on /v/.
why? it seems like a normal game
>>
File: 1711625664728108.png (945 KB, 1024x1024)
945 KB
945 KB PNG
>>103474573
the chinks has conquered this territory, it is Hunyuan general now, surrender
>>
>>103468004
BunnyAyumi Flux1 LoRA


https://huggingface.co/AiAF/bunnyAyumi_LoRA_Flux1

https://civitai.com/models/230084/bunnyayumi
>>
>>103474741
>>
>>103474526
>>103474736
I kneel to the Chinese and their feet models
>>
>>103474743
>>
>>103474699
long checkered history that boils down to the moderation of /v/ have it out for anything videogame related on the board that isn't obviously paid for or nintendo- related, but especially loli or furry adjacent, which gets instantly nuked. Doesn't help that discordfags ruin everything even for legitimate discussion so that gives them leeway to 404'ing threads.
Atlyss is the new FOTM especially for being a good game, but because it has a dangerously furry flavor to it, threads are getting nuked too.
>>
>>103474753
>>
>>103474757
>>
>>103474761
>>
File: example_fm15328gt.jpg (55 KB, 1024x1024)
55 KB
55 KB JPG
>>103474766
>>
File: example_y5ex0w4rb.jpg (73 KB, 1024x1024)
73 KB
73 KB JPG
>>103474769
>>
>>>/g/sdg
>>
>>103474756
or more like just like the pedo poster in this thread it always results in actual pedo content because it's a game of escalation.
>>
>>103474102
>>103474122
I thought it would be somewhere in 2025 as they need to retrain the model or something.
>>
>>103474837
yeah, they have to redo all the training all over again, I hope they'll release MLLM before that one though
>>
>>103474868
Chang! Those capitalist pigs are making fun of us!! :'(
>>
>>103474413
obviously generated would look perfect
>>
File: 1729018643666698.mp4 (571 KB, 960x544)
571 KB
571 KB MP4
>>103474736
>it is Hunyuan general now, surrender
>>
>>103474886
>they'll release MLLM
why?
>>
b-bake?
>>
>>103474944
China is obsessed with legs, nylon and feet, and I love them for that.
>>
>>103474945
>why?
because it's the official text encoder? We're playing with a duck tape rignt now
>>
>>103474953
you mean the one we're using makes the generation worse?
>>
>>103474956
of course, Hunyuan wasn't trained with llama3, when you use the API it understands the prompt much better than what we have now
>>
>>103474966
OK makes sense
I wonder why they don't release it
>>
>>103474953
that's a lateral step, I'd rather we get extra functionality before minor improvements
>>
>>103474989
it's not like they'll have to stop the training of the i2v video just by releasing MLLM, they already have that encoder ready, all they have to do is to put it on huggingface, literally takes 5 minutes
>>
>>103474984
>I wonder why they don't release it
I have no idea, someone made an issue about that, I hope they'll answer
https://github.com/Tencent/HunyuanVideo/issues/93
>>
File: 1200.png (1.4 MB, 1200x853)
1.4 MB
1.4 MB PNG
>>103475004
>>
>>103475004
is it even theirs to release?
they stole everything else, why not the text encoder too
>>
>>103475033
kek
>>
>>103475040
>is it even theirs to release?
Idk, they called it Hunyuan MLLM, the duck tape we have is definitely not theirs though, it's a llama3 model
https://github.com/Tencent/HunyuanVideo/blob/main/ckpts/README.md#download-text-encoder
>>
>>103474966
>when you use the API it understands the prompt much better
Did anyone try using the API enough to confirm this?
>>
File: 1715953578884438.png (837 KB, 2632x1456)
837 KB
837 KB PNG
>>103475054
>Idk, they called it Hunyuan MLLM
https://arxiv.org/pdf/2412.03603
when you look at their paper page 8:
>We have configured HunyuanVideo with a series of MLLMs [78, 17, 26] for different purposes
78 is about this:
>Hunyuan-large: An open-source moe model with 52 billion activated parameters by tencent, 2024
17 is about this:
>XTuner Contributors. Xtuner: A toolkit for efficiently fine-tuning llm. https://github.
com/InternLM/xtuner, 2023.
and 16 is about this:
>A family of large language models from glm-130b to glm-4 all tools, 2024
>>
>>103474966
>when you use the API it understands the prompt much better than what we have now
maybe it's because we're using the distilled model and the API has the real one?
>>
File: 1704655830558719.mp4 (301 KB, 960x544)
301 KB
301 KB MP4
>>
>>103475099
I remember an anon comparing the API with the local with the exact same settings (same seed, resolution, prompt, guidance...) and the API performed better indeed
>>
>>103475199
That was me but that was sample size of two and the only objective difference was that my attempt didn't spell out "WAKE UP" in the sand correctly.
We don't know how consistent proper spelling is and how much seed cherrypicking they had to do to spell it correctly.
>>
>>103475219
yeah but the difference is there though, but we can't blame all of it on the text encoder, could be fp8, could be the distillation, there's a lot of reasons to it
>>
>>103475172
>.mp4
oh shit, /ldg/ can take mp4 now? I remember having to convert them to webm to post videos here
>>
>>103475239
/ldg/ is about videos now
>>
Bake?
>>
If the collage doesn't move don't fuckin bother
>>
>>103475488
>>103475488
>>103475488
>>
I'm getting a new GPU soon all I need to know is can I generate feet with 16GB of VRAM
>>
>>103472252
Odd gender reveal.
>>
>>103475239
The whole site can.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.