Discussion of Free and Open-Source Diffusion models.7 minutes is too long to wait for a single video gen EditionPrevious: >>103513104>UIMetastable: https://metastable.studioSwarmUI: https://github.com/mcmonkeyprojects/SwarmUIForge: https://github.com/lllyasviel/stable-diffusion-webui-forgereForge: https://github.com/Panchovix/stable-diffusion-webui-reForgeComfyUI: https://github.com/comfyanonymous/ComfyUIInvokeAI: https://github.com/invoke-ai/InvokeAI>Models, LoRAs, & Upscalershttps://civitai.comhttps://tensor.art/https://openmodeldb.info>Traininghttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/sd-scripts/tree/sd3https://github.com/derrian-distro/LoRA_Easy_Training_Scripts>HunyuanVideoComfy: https://github.com/kijai/ComfyUI-HunyuanVideoWrapper/Windows: https://rentry.org/crhcqq54Training: https://github.com/tdrussell/diffusion-pipe>FluxForge Guide: https://github.com/lllyasviel/stable-diffusion-webui-forge/discussions/1050ComfyUI Guide: https://comfyanonymous.github.io/ComfyUI_examples/fluxDeDistilled Quants: https://huggingface.co/TheYuriLover/flux-dev-de-distill-GGUF/tree/main>MiscShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/Generate Prompt from Image: https://huggingface.co/spaces/fancyfeast/joy-caption-alpha-twoArchived: https://rentry.org/sdg-linkSamplers: https://stable-diffusion-art.com/samplers/Open-Source Digital Art Software: https://krita.org/en/Txt2Img Plugin: https://kritaaidiffusion.com/Collagebaker: https://www.befunky.com/create/collage/Video Collagebaker: https://kdenlive.org/en/>Neighbo(u)rs>>>/aco/sdg>>>/aco/aivg>>>/b/degen>>>/c/kdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/tg/slop>>>/trash/sdg>>>/u/udg>>>/vt/vtai>Texting Neighbo(u)r>>>/g/lmg
Blessed thread of frenship
https://civitai.com/posts/10256589Holy shit...
>>103519758fuck, the workflow is only about the sound part, not on how he made those videos in the first place
>>103519758well it did better than i thought it would.
If there was a button that deleted all video tech I would press it
>>103519730yeah it's the same censorship too, so not the pony of this model yet
>>103519786good thing you don't have that power Stalin
ok so other than block swapping what are the big low vram optimizations I need to install to get my frames up?
>>103519796You need to install a new gpu.
>>103519758https://civitai.com/images/45265861this one actually looks good and not like some girl having epilepsy
>>103519796it's the biggest one, rest doesn't matterjust be very patient
>>103519807What did you prompt for that?I tried to make it generate a vr video but had no luck.
>>103519668where collage
>>103519811>just be very patientMy gen times are fast. I don't need speed, I need memory tricks.
>>103519796Install more vram https://youtu.be/14jzlR4yGCQ
>>103519824>video shot with a 180 degree fisheye lens
>>103519808>this one actually looks goodit's just a matter of luck, it's still too inconsistent, but the simple fact that it can do porn out of the box is already insane
>>103519887Man, just a couple of times is great, I hope he'll share the way he does it if it's not too schizo.I'm not interested on the audio, though it's a nice bonus.
>>103519887the fun thing about porn being one of the most plentiful datasets with all positions, styles and ideas possibles, and yet here we are trying to get it to do simple penetration
>>103519946It's possible that it didn't train on that many videos of porn, but instead was more images of porn so it doesn't quite understand the movement yet
so next SOTA will be a video, image, and audio model all in one, right?
>>103519946>we are trying to get it to do simple penetrationfor a base model that's already impressive, what other base model can do porn? none, Flux doesn't know what a penis is, SD3 can't even lie a woman on the ground lol
>>103519956Oh for sure it's that, it doesn't seem to understand penetration (and most positions) by default.So they probably avoided adding most actual videos of porn.
>>103519967img to videomore than 5s while staying coherentactual sexual stuff for both 3d and 2d
>>103519967>next SOTA will be a video, image, and audio model all in one, right?Hunyuan can do video, image and audio by itselfhttps://www.youtube.com/watch?v=6MISaOhNqmg
https://www.reddit.com/r/StableDiffusion/comments/1hedg7a/improving_hunyuanvideo_with_clip_finetune_factor/That guys always sounds insane but it looks like he knows what he's doing
>>103520061As long as it's not that Turkish lunatic I'd take a look.
>>103520061>But, I think by giving my SAE-CLIP finetune the right OVERLORD influence (top right), the result is best (happy to have you disagree, if you do!). tbqh his clip is the second worst
>>103520061>AI & I do prompt engineering towards prompt criticality.can't take this guy seriously.
>>103520125I shouldn't either, but I like his clip finetunes, they are definitely better than base clip_l, still using smooth to this day
>>103520061His versions are the worst there
>>103520136>they are definitely better than base clip_lI never switched to using it myself. From what I remember didn't look like much of an improvement so skipped over it.
>
So anyways
>pull>Get a cuda error during vae decode or the terminal just closesI will never learn
this is ithttps://civitai.com/models/1038199/nsfw-hunyuan-lora?modelVersionId=1164548
>>103520658Would be it if he trained it on irl and not 2d stuff.Hunyuan sucks at 2d.
>>103520705maybe it can work on irl stuff, I saw the workflow he uses "hentai" at the end of the prompt, you remove that and see if it makes it realistic
>>103520658Good. ACCELERATE. The lora era is HERE.
>>103520658What are the odds that we’ll be able to use both a motion Lora and a character Lora together? Asking for a friend of course.Although you might not need both if we’ll get a coherent image to video setup.
>>103520796>What are the odds that we’ll be able to use both a motion Lora and a character Lora together?I'd say good odds? stacking multiple loras has always worked on other models
>>103520658https://www.youtube.com/watch?v=Q929ZMezRvs>Mr Bones - There are no brains
>>103520855>brainsbrakes, fuck my brain. There are no brakes on this train at all. Just awaiting the fucking normie melt down. N-NO YOU C-CANT JUST MAKE AI VIDEO PORN!!! HOW HORRIFYING SOCIETY WILL SURELY COLLAPSE, THE FUCKING SKY IS FALLING.
>>103520885>Just awaiting the fucking normie melt downsame, where's the meltdown on hunyuan? this shit is as uncensored as it gets and no luddite seems to give a fuck, what's happening?
>>103520895ahahaha, i remember those idiots saying we would never have something like this back in the summer and i said "I give it till the end of the year and you will eat those words" I am an 80's kid and I've seen before how fast new tech moves once it gains interest.
>>103520920>i dont think anyone really knows about it desuI highly doubt that, it's third on the huggingface popularity list right now
>>103520934how do you think the normies get the AI info from? I guess from the media, but the media knows how to look at the new hot thing, they know about hunyuan there's no way they don't
>>103520915Same I remember saying the same to them and pointing out how things always chnage and how everyone always acts like everything stays the same, yet they laughed at the idea of local video lol
in the last 48 hours we just blew everything apart, down with the likes of pornhub and its scummy industry that created the problem, we are the fucking solution and the fucking absolute. We will crush that entire industry for exploiting us. Pretty soon everyone will be bored of porn because they can get their instant fix anytime they want all for free. Just you wait they will be kicking off about us anons :-)
it's getting really schizo in here
>>103520979why?
>>103520895>this shit is as uncensored as it gets and no luddite seems to give a fuckI hope it'll stay that way, so that it could give some balls to the other AI companies like SAI and show to them that it's ok to release uncensored models
>>103520966mate i've some ideas in my head to make an even better video model since using this thing and coming to realize how it works. A one that would work way fast and taking a much different approach, using only stable diffusion models like SDXL and a bit of math to work out differences between images and then using a sorting algo. This truly is the beginning, those normies better buckle up.
https://www.youtube.com/watch?v=n4RjJKxsamQ
>>103521002Sounds interesting :o, and yeah new ideas and techniques will always come too, so if one thing gets to a dead end and the door closes, another door always opens. People often calculate the future based on what we have today rather than what we might have in the future haha
>>103520658give who done this your buzz
>>103520915ok boomer
>>103521042true, it's weird when everything is going smoothly, it shouldn't be
>>103521038if we use 2 reference frames, one is the start frame and the other is the end frame, then we gen a lot of images from one prompt and then do some image difference math, then sort them in logical order, then we have our flow right? Ah still thinking about it, i dream in stable diffusion FFS...
>>103521038and yeah tagger models are getting rather good, it wouldn't take much, but would it be faster? I don't know, but with the likes of sdxl and pony we have lcm which is very fast. I have a pony lcm weights lora, it was the first ever its fairly recent it just needs to be used at a low weight like 0.2 and no negative and cfg 1 and its really fast.
>>103520986We're accelerating too fast, so the event horizon looks schizo. It's crazy because yesterday morning we had static cowgirl sex with no movement. Then, cowgirl with movement but fucking demonic spasms. Then a gen last night so peak that people were begging the anon for his workflow. Yet it's already obsoleted by new civitai loras. And these are literally alpha loras on an alpha lora training code on kijai's experimental (and constantly breaking) node for a base model that hasn't even finished development yet (they promised MLLM, i2v, more vram/gpu splitting improvements, etc etc etc). Oh, and we're using fp8 not a proper Q8 gguf quant.
>>103521133>We're accelerating too fast>a base model that hasn't even finished development yet (they promised MLLM, i2v, more vram/gpu splitting improvements, etc etc etc). Oh, and we're using fp8 not a proper Q8 gguf quant.that alone shows it's not accelerating fast enough, I to want a Q8 quant and MLLM, that alone will make the model much better overall
>>103521133The schizo, ladies and gentlemen.
>>103520658>>103520705>Would be it if he trained it on irl and not 2d stuff.it works well on irl stuff if you decrease its strength to something like 0.7 or 0.5
>>103521175might work, but might still feel like an animation made in blender hmmm
How do trains work
>>103521187well, he trained on 3d renders so yeah those are blender animations
>>103520973Nah, reddit, pornhub and the rest of them won big time. They managed to kill off the real amateur porn by nuking it all from orbit. Thousands of terabytes of the genuine stuff from 00s and early 10s, all gone. Replaced by the soulless shit from onlyfans and the others. Horny amateur sluts who were filmed for fun and gained absolutely nothing from it lost to the greedy professional whores, pissing themselves with dead eyes to justify your $5 subscription. I hate this. There's nothing good left out there to train an amateur video lora to recreate those feelings.
>>103521153Mllm ?
>>103521208The new generation by default has more of that dead eye look for some reason, something about social media gives them that boring expression.(of course not of everyone. bu it's something I've noticed)
>>103521133>begging the anon for his workflow. Yet it's already obsoleted by new civitai lorasaye but i've work on it hard to redeem myself, its being tested, it produces enough to feed the machine and with the lora on top it should be pretty fucking insane. but wait out lad, the future is rewriting the past right now that is how schizo crazy this shit is, this could be really it.
>>103521222We're not using the official text encoder, the official one is called HunyuanMLLM and they said they haven't released it yet, so for the moment we're actually playing with a duck tapehttps://github.com/Tencent/HunyuanVideo/blob/main/ckpts/README.md#download-text-encoder
>>103521250We don't know how big a different the official MMLM is over the one we have now.
>>103521265it's probably a finetune of llama-llava-8b, it can't be too different, for example joycon is also a finetune of L-L-8b and when you plug on hunyuan it gives you nonsensical outputs
>>103521153its moving rapidly in ways you could not comprehend, did you ever think we are already living inside of it now? Is that air you are breathing right now anon?
>>103521283I always knew, I just wonder who the real me is behind all this.
>>103521303one day I just imagine everything will burst into a surreal experience and the sky will open and we will unite into oneness that we will remember. That would be the singularity which is moving rapidly always backwards unraveling the past. What do you think this "car size drones" are? Iranian mothership who the fuck writes this shit kek? A media or a system that is scared of losing control. project looking glass ring a bell? They knew it was coming and there was no way to avoid it.
>>103521337There is a connection here, I was actually watching a video about the drones while reading this reply to me....
we local degen general now
>>103521247Obsoleted is probably too harsh of a term since it's still very useful. Keep working on it
Just so you know I all think less of your for jumping up and down like screeching baboons while ranting about the future being here because you saw a 3 second clip of a very wobbly penis going into a vagina.
>>103521412>I all think less of your*you
>>103521412>Just so you know I all think less of your for jumping up and down...Good morning sir
>>103521405firing the weapon now sir. first test of new method. I have the new lora ammo sitting on the platform, it will be loaded next and then fired.
>>103521412Pretty soon it will be our wobbly penises going inside those vaginas. The future is now old man.
>>103521412Your english is terrible SAAR
For anyone using Cubey's LoRA, I'm getting the best realistic results by combining with a photographic character LoRA and a prompt that emphasizes specific things like this:> nsfwsks, a girl is having (missionary sex:1.2) with man out of frame, (his penis is going deep in and out of her pussy:1.3), (repeated motion:1.3), hentai, (ohwa person:1.6), blonde hair, she is (lying on a bed:1.2), she is moaning, the camera is stationary
>>103521412>Just so you know I all think less of yourHow am I gonna recover from this :'(
>>103521430>Make one typo>Immediately demoted to street shitterInsanity.
>>103521432> (repeated motion:1.3), hentai, (ohwa person:1.6), you'll get better results by removing the "hentai" token first lol
>>103520930It's not going to hit normies yet until someone like Sarkas or Aitrepeneur cover it in one of their tutorial videos.
>>103521441>oneOne? Your whole sentense reeks of curry esl saar.
>>103521448Incorrect. It goes completely off the rails unless you include that since it's an important part of the training prompt.
>>103521432>(bob:1.2)>(vageen:1.8)These should not work with the current text encoder. I don't know what your reasoning is.
>>103521412it is a shame these generals have devolved into what they are now.
>>103521470>Incorrect.correct, I got good results by removing "hentai" as long as it follows the workflow's settings (640x480x49f)
I didn't ask for this
>>103521494that looks cool though
>>103521491Would one of you fucks post a catbox or I don't believe either of you
>>103521494Gnarly
>>103521515NTA but there is no way in hell that (word:1.2) does shit in hyvid
>>103521376>braaaaaapsquuuueeeepppbrbrbrbr!Not being funny that was the first thing when i read her expression.Goodnight
https://www.youtube.com/watch?v=1UUYjd2rjsE<3
>>103521580>ScorpionsThought /ldg/ were just fans of their one album cover, didn't know you liked the music too
>>103521580https://www.youtube.com/watch?v=oxZxe092eqo
>>103521529correct it does not understand that and also the negative cfg thing will oom your shit which is a real shame because it was useful. Oh well. Tbh i rarely use negative prompts these days anyway because they can negatively influence the image through restriction.
>>103521580>>103521629I don't want to sound rude anon but you lost a lot of credibility by making a schizo workflow and at the same time assuming that going for denoise 1 wouldn't completly destroy the input video in the first place
>>103521529LMAO you clearly haven't tried it, prompt weighting works just fine in Cunnyun
>>103521083oh man I love death grips
>>103521638i mistake i know anon, its not the same as other samplers 1 denoise in hunyaun is not the same. It will replace what every frame with random noise at 1 and then denoise that i realized my mistake after sleep (I can be up 48 hours no joke) and i disclosed that i was wrong but i learn a few things from it. As soon as i knew my mistake i notified them anons in the porn thread, then someone kindly linked it in previous thread on /g/ and label it meme workflow. So that is why am very busy now to make something that actually works to redeem myself for that failure and disappointment. In reality i know too much anon, and this anon will deliver soon.
>>103521680like i could put on first image at stage on an ipadater and an image loader for you to load your favorite and have the model make all frames of her face and body and style, but I know not to do that... It will be abused to fuck...
https://civitai.com/models/1038512/super-saiyan-hunyuan-video-lora?modelVersionId=1164936I'm gonna have so much fun with those loras, and it's just the begining
>>103521657The burden of proof is on you.
>>103521708>0 downloadsWould have been more honest if you said "Hey I made a LoRA, try it out."
>>>/aco/8643760frame rate is a little slow, can fix that, but here you go anons.
>>103521733lol, I have no idea how to make those things, I'm just on my "f5 spam" phase on civitai, like I did during the first loras of flux
>>103521740>here you go anonsand like the day before you didn't share a workflow
>>103521751its coming man you fuck head relax jesus, why be a bitch i have to make sure its right and works god damn. Your nastyness will not prevent me post it so fuck off fed, neck your self
>>103521740ootl what is this? some kind of hacked together i2v?
>>103521763you wasted everyone's time yesterday with your broken workflow, I think you should lower your motherfucking tone down, you'll be allowed to talk like a big boi once you'll show a functioning workflow
>>103521774something special that was i2v interpolated then refined and send in as reference, second stage is wack though i will post it to show in a few.
>>103521788>>103521784ahahah yeah what retards to think any one cares...
>>103521740and you think the one thing ai tech should be used on is the one thing the internet is overflown with: pornok
>>103521724nigger this isn't a court of law, i'm not the district attorney. try it or don't, but you're a smoothbrained promptlet if you don't use weighting with this thing
just scranned some beef stew out of tin, no time for meals. i will drop workflow as is but i'm still working on it. I will probably remove the old concept group because its not good enough, this is better.
or maybe i just fuck you and kept if for myself...
>>103521742No shame I'm in the same boat.
I'n all honesty i could care a less about you dickheads your all stupid cunts. pic related. I don't any of you contributing, except the lora guys that gave us nice things, all you lot are braindamage
>>103521937yes, and bottom right unironically the best and he completely ignores it
bye bye
>>103521946>drama queenIkr, he sounds like a chick, I won't be surprised he will troon out in a near future or something
all these fags spending multiple thousands to generate a few seconds of videya that my consumer card will be able to do in a few monthslol !
>>103522071What are you implying?
>>103522071> consumer card> 4gb vram
we truly post in a local diffusion thread
Is it (Sentence prompt that I want noted.):1.2Or (Sentence prompt that I want noted.:1.2)In comfyui with NTRMix and Illustrious models?
>>103522291Highlight the thing you want emphasized and press ctrl+ on your keyboard and it will do it in the correct format for you.
>>103522357Ctrl+ upFucking 4chan doesnt like the up arrow.
>https://github.com/ai-forever/Kandinsky-4and noone is talking about it. /ldg/ has fallen
>>103522557the video part is CogVideoX tier
>>103522584>10 secondsthis alone could mean something is there
>>103522613you can do 10 sec on hunyuan, if you have enough vram
>>103522584Why does the cat look so nervous?
>>103522621>if you have enough vramthis is the part that's shitty
>>103522584just like how llama-3-405b made mistral release mistral large, i feel like hunyuan made these guys release
>>103522706tummy
>>103522584why is there only one example of a human and it's just a headshot of a motorcyclist wearing a helmet. Can it do people?
>>103520658Cant seem to get a good gen using its keyword and something like girl having sex in cowgirl position. Any tips?
>>103522755uhhh sweaty you should be prompting older woman or mature not girl.
>>103521494GLORY TO THE PISSBIRD
>>103521494PRAY TO THE FALCON! RECEIVE GOLD DUST!
I'm seeing that people with 3060s are running HunyuanVideo. Is there a guide for how to get this running on linux? How much RAM do you need if you've only got a 12gb vram?
Is there a reason that the prompt is truncated to 256 tokens when the Llava model is supposed to handle up to 8192?
>>103522973unknown
>>103523283Don't fucking reply to me, pedo.
>>103522912I'm pretty close to the limit with 32gb, I could squeeze more out of it with block swapping if I had more, but it would be so fucking slow that I doubt it would be worth it
>every time i go to sleep then wake up it ACCELERATES
Working late tonight Agent Gonzalez?
>>103523340so you're saying there's no point in trying with a 3060?
>>103523525if you have one then what are you doing talking, go for itif you don't then I wouldn't recommend getting one for hunyuan
>125 seconds per step>450 steps (for a trial)that's... oh fuck.but hey, it's working
Seems like Hunyuan can handle 3 more seconds for total of 193 frames. But only at higher resolutions it seems. Not just skipping around, but a genuine smooth continuation of the previous frames. Catbox (nsfw) was the first gen with 10 steps flow=17, this is the 30step flow=7 version that is somehow no longer nude lol so I can post it directly. If you try this with lower resolution you get nightmare fuel.https://files.catbox.moe/1agjhn.mp4
>>103523416>this post got deleted toowas he samefagging and calling himself based?
>>103520796once we get image-to-view character video loras will become largely irrelevant.
>>103523442He was replying to himself saying shit like "your settings are so good, post them! I'm also genning cunny" and "based pedo, you're the best" lmfaoFeels like something weird is going on just because this one's from China. Someone wants it all shut down.
>>103523713What are the other settings/your hardware? I was getting 110seconds ish for 960x544@195f with some offloading on a 3090. We need to be able to split the vram between gpus so bad.
>>103523798yeah we established that like 10 threads agoits weird stuff cause its not like its subtle, and when enough anons point it out, they go nuclear and start randomly starting arguments with anons having normal conversations and doing the spiderman meme keklets just hope we can keep getting advancements and not end up with this shit suddenly shut down in 3 months or so.>i cant share a single thing ive been genning all month except something like picrel since its all explicit nsfw
>>103523820Why didn't you make a video out of this?
>>103523809I'm training a lora, I didn't manage to get this far yesterday and I'm probably gonna kill it after the first checkpoint
>>103523843>gtx 1080 ti
>>103521412>because you saw a 3 second clip of a very wobbly penis going into a vagina.this is like Neil Armstrong stepping on the moon
>>103521412>3 second clipWe've already have 8 seconds of coherent video confirmed. Possibly 10 seconds. And this is just the beginning.
>>103523867Neil Armstrong landed on the moon and then nothing much newer happened after that. Not a very auspicious example.
>Omg more porn!
>>103523820I wonder if we'll see another round of agitation about AI dangers/etc.
3.33 Eva seems fun.Using this atm: https://files.catbox.moe/3vr6k0.json
>>103524116>(Lick.)on par with picrel
>>103524116Oh, and I'm playing with just using some tfs instead of min p. Trying to find a balance of fun but smart even on contextless dumb stuff like this
>>103524126Meh, Im just trying to see how it writes / acts. Swear I've got actual quality shit elsewhere.
>>103524116ew
>>103524116>>103524126>>103524131>>103524138Wrong thread
>>103524147Recommend some normie card. Chub is full of hot garbage even if I sort by likes or whatever.
>>103524147i don't care *gives you a wedgie*
>>103523864
>>103523962Sex and war is the driving force of humanity. It's all these models will truly be made for, everything else is just derivative.
what's the skinny on local model (possibly diffusion? I dunno) trellis
>>103524168yeah but at least i can still gen in sdxl fine enoughat least.
>>103524116Based horse enjoyer.
>He goons to 3 second clips he spent 8 minutes generating
So hot
>>103524506More like 20 minutes. Can't wait until someone makes a Hunyuan Turbo XL of this garbage.
>>103524788>Turbowhy would you want to further lobotomize something that's already very SOTA and WIP?
>>103521250Thank you king!
>>103524865So bootiful :''(
>>103524506>>103524746Does anyone by any chance still have the Lora for the Bogs? These are so bloody good, can't stop fucking laughing.
>>103524928https://civitai.com/models/1035770?modelVersionId=1166218There you go
>>103524972What a fucking legend, thanks king.Something I'll look forward to when I get hang of comfyui more.
question, noobAI vs pony, nai seems to work well with just booru tags, but is it better at this point?seems to do backgrounds nice, this is without a rei lora for example.
>>103525154Yes.
>>103525161whats the diff between vpred and eps models? all i've read so far is eps is better for loras, apparently
>>103525168vpred supposedly has better contrast
>>103525176this is the latest vpred one (still learning settings/etc). so far im impressed, this is just a generic 1girl, frieren tag, classroom prompt. it's nice that there are good results even before character/style loras.
>>103525186same prompt except suzumiya haruhi:
>>103525192one more, just with mari booru tagless reliance on loras is nice BUT you still have the option to use character loras, or style loras. but artist styles work too, its a neat model.
bulma waving hello, no lorashirt is easily fixed with inpainting with flux fill or PS, still neat
>>103523727how much time to generate that?why is you flow so high?
>>103523978it never went away, every few days some journalist finds a new angle to to fuel the panic
I like the aesthetics but more importantly the model can do good backgrounds, that's my main gripe with pony based checkpoints. even without loras I like NAI so far.
How do you get less realistic more "perfect" skin? It's giving me blemishes instead of smoothed photoshoped perfection in higher steps in the videos I'm genning.
>>103525258your shit must be cursed because basically every video shown in this thread and on that porn general have girls, even the guys, with perfect skinyou should show the video and your settings so people can help
https://www.reddit.com/r/StableDiffusion/comments/1hen24r/comfyui_fluxmod_run_flux_with_88b_parameters/https://github.com/lodestone-rock/ComfyUI_FluxMod>A modulation layer addon for Flux that reduces model size to 8.8B parameters without significant quality loss.Interesting technique, if this can be used on Hunyuan we could be eating really good
>>103522557>and noone is talking about it.becaus they only released the bad version lol
>>103525121>>103525479Would you share the settings you used to train? Specially the amount of pics.
>>103525479>You're a wizard, Igor
>>10352549428 images of Igor and the other one. They were basically together in all the images.Tagged with joycaption but I don't know how necessary the tags even were.1100 steps for 40 epochs.Rank 32, but I think it would work just as well at 16tbhMy previous one was at 600 steps and it also functioned fairly well but tended to produce partially bogged subjects than fully bogged ones.
>>103525520>They were basically together in all the images.damn maybe that's why the lora is so fucked, might just try to crop igor out of every image and train on the crops. Could even use SDXL to touch it up slightly to a proper resolution even.
>>103525176Vpred's contrast is too high, and it generally seems to overcook the image to shit and back. This is epsilon11
>>103525526Fucked? I think it does what it was supposed to. Bog people.
>>103525536...and this is vpred with the same prompt and seed.I dunno, maybe it needs a different config or something
>>103525537>I think it does what it was supposed to. Bog people.well you certainly got me therejust figured it was still a WIP because you never re-recreated that one dracula gen since the initial gen was a failure with bogman looking like a still cutoutthose were hilarious by the way, the initial lora attempt just having them stand around like cardboard cutouts really awkward and uncanny.
>>103525571The first one was seriously overtrained on a dataset that was basically 4 4x4 grids of bog faces. It was never gonna work. Maybe I can get a Dracula out of this one, let me try.
yeah, i'm figuring out stuff like cfg and using the advised positive/negative prompts, but I can see the strengths of noobAI and illustrious right now.latest vpred (0.9) model:
>>103525608compared to pony, it seems to have better colors/shading, and NAI can do backgrounds (ponys main flaw, even if you can edit it later with a white background)
https://github.com/kijai/ComfyUI-HunyuanVideoWrapper/pull/149Thoughts?
>ponyAny "insider" info about the progress? Did he decide to drop the idea? It's 2025 in two weeks, v7 or v6.9 whatever should have been released in Summer to remain on top of the game.
>>103525520how long did that takeI haven't really fucked with wsl settings at all, but I was getting 125 seconds per iteration and cancelled it after the first epoch
>>103525616but it's amazing how well it works with generic booru tags. use this extension with it:https://github.com/DominikDoom/a1111-sd-webui-tagcompletethen get style loras or characters if you really need an obscure one. still, very impressed just with very little use of it. Using the default positive/negative prompts (like score_9 but for illustrious)Prompt Prefix:masterpiece, best quality, newest, absurdres, highres,Negative Prompt:worst quality, old, early, low quality, lowres, signature, username, logo, bad hands, mutated hands, mammal, anthro, furry, ambiguous form, feral, semi-anthrojust removed the nsfw negative prompt cause thats for me to decide. I love the colors, though.
>>103525645it's a meme, using a visual model to get the prompt from an image won't get you something even close to what you're trying to achieve with i2v
>>103525654>fantastic model>doesn't recognize unicorn overlord charactersSo close yet so far
>>103525653It should be like 6s/it on a 3090. Something is wrong.This was like 2 hours of training.>>103525669I figured at much, any experiments I've done using clip embeddings as a prompt in other projects never work.
>>103525645potential bigtime happening? I hope you hunnyan boys are running over 20gb of vram kek>>103525648eternally btfo'd by illustrious, then its grave pissed on by noobai. It's over for ponyfag.>someone please for the love of god make a ponyrealism equivalent for noob im dying here the chink bugs are the only ones doing it and they suck>picrel genned in ponyrealism
>>103525654*also, adetailer and controlnets work fine with this too, all the usual extensions work fine. civitai helper too for your loras/styles/etc.>>103525684thats where the loras come in. still, amazing model: it even knew misaki from NHK from the booru prompt.https://github.com/DominikDoom/a1111-sd-webui-tagcompleteI didnt make this (really) but it's so good for adding prompts with the exact tags (what the dataset was trained with).
>>103525701only thing I dont know yet is whether cfg 4 or 5 is ideal, it just recommends a range. I am a noobAI noob, ironically
>>103525689I know that it got btfo'd, switched myself since 0.1, but I wonder if there was any official word from the ponyfag. Or any observable butthurt, anything.>SDXL -> full anime/cartoon tune -> back to full realism tune pipelineUnironic mental illness.
>>103525701Yeah i've been using tag autocomplete since 1.5, it's great.Noob is also a great porn model. Very knowledgeable about interactions between two humans and with fantastic creativity.
>>103525686kek why does he look so confused, baffled, even terrified of the wine glass?its like he was having an existential crisis and thinking to himself "..what IS a man?!">>103525722>Unironic mental illness.well how else do you expect to get the creativity and robustness of a 2d model in the form of 3D/realism? a realism lora? LMAO
>>103525727He's supposed to be "sneering" but I guess he was too bogged.
>>103525725I did a nude prompt (not of rei) to test. it's very good, surprisingly good nips. What I like most is how many characters work for prompts without loras, now I can use loras for styles mainly or obscure characters.Is training illustrious based loras simple? it's SDXL based so does kohya work?
>>103525742>not of reiThat's good because if you did I'd have to report you to the FBI.
>>103525727I expect them to make an SDXL finetune in the same way pony/illustrious were made. By grabbing a ton of varied pics and following the recipe.Yeah maybe you won't get something extremely exotic but no way the general quality won't be higher. Though I wonder what kind of concept you can't find in 3d, that you still could portray in 3d by genning.
>>103525751nah, I test that stuff with something appropriate: igawa asagi....which the model knows without a lora, it basically knows everyone. Although I couldn't get noko shikanoko to work but deer girl anime is new, can use a lora for that.
What's with the "very awa" tag i see in many civitai noobai prompts? Some sort of quality tag?
it even knows umu with no lora. progress!>>103525775I think it's like pony's score_9, it picks the very top results or the best results, it's like a masterpiece prompt. the civitai page for the model has a list of recommended prompts but oddly the awa one isnt there.
>>103525787>isnt thereIt's on the HF page.
>>103525753this shit is nuts man, i do think you gotta be on the spectrum in some major form to be able to pull off any of thiswhich is why i hope next year i can jump on the train and do some of the work myself with a new system. I feel like we still haven't fully seen the potential of noob yet.
no lora, yep noobAI/illustrious is the way. oddly enough the model wasnt working in my other webui (for pony/sdxl) but everything is fine on a fresh reforge install/git clone.jack the ripper \(fate/apocrypha\), fate \(series\),masterpiece, ass, best quality, newest, absurdres, highres, very awa
>>103525787My nigga it knows even Fiorayne from Monster Hunter. Of course it knows mainstream characters.
>>103525820*im not sure if the awa prompt is necessary but it seems to work, so it may be a score_9 type of thing. it works, so ill just leave the basic prompts as default in ui-config.json.
aesthetics aside, the main benefit is that it seems more like anime and less "fake", the colors are nicer, the lineart seems better, and most importantly it can do backgrounds. Pony was SHIT at backgrounds. Good for nsfw/poses, bad at backgrounds.
Pony guy still going over licenses with his expensive lawyer trying to find the cheapest model to train on so he can scam for more while the rest of the world has moved on.
I thought bfl had changed the flux license terms months ago
>>103525846Experiment with different artists too. They change the output tremendously.
>>103525846also, controlnets seem better: here is a canny with saber as the prompt.very good model, it's a definite step forward.
>>103525884*with pony I always had to change it to "prompt more important" to get it to work. this works fine even with the default balanced.
>>103525884same image but with camilla \(fire emblem\) instead, works well: and again, no lora was used!
That's only one guy talking to himself, he always ends his sentence with a period "."
>>103525900what a time to be alive where we can generate bogs in any scenario.
>>103525915Sometimes the bogs turn Chinese and I don't know why.
>>103525952with this and ai audio the possibilities are endless.speaking of which, I can get perfect Trump audio with e2-f5-tts, it's amazing for natural sounding speech with a small sample.
>>103525960I think there are packages out there that let you puppet and reanimate faces. I forget the name. But you could easily have the moving clips speak in any way you want them to and it looks fairly seamless.
>>103525915>>103525960>>103525970>>103525952see? he always ends with periods >>103525914now I'm starting to think if this is a bot instead
>>103525952>Chineselooks like Taiwanese.. taipei 101 in the background. God damn I miss those chicks.
>>103525973You're a fucking schizo. You're supposed to end a sentence with a period. Generating bogs and posting them while I do work stuff doesn't make me a bot, retard.
>>103525983>writing well on 4chanlol? lmao even?
>>103525983kek dont respond to 'em boganon, it's probably the same fed fag that posts the pedo shit and tries to start the infighting.i wonder what igor in tiananmen square would look like.. i wonder if it'd make him even more chinese than this >>103525952
>lmao xd u use capitalization and full stops? liek ru a fggit or wat?
>>103525997>capitalization and full stopsyanked brit or a lime'd yank?
Perhaps xi was already kind of bogged?
>>103525988It's automatic. Imagine writing 10–20 emails a day that must be grammatically perfect. You get used to it.
>accusing a random person of starting infighting for no reasona little ironic dont you think
>>103526049>Imagine writing 10–20 emails a day that must be grammatically perfect.I work as an engineer and I let chatgpt do the faggot e-mail writing for me kek
>A man dances at Tiananmen square in front a tank.
>>103526098>those chinks haven't even censured tianamen squarehow based are they seriously?
>>103526110It's a psyop to get our vram occupied with their video model to stunt any research in other areas.
>>103526110tiananmen square is a real place that has tank parades all the time anon, just because they ran over some protestors skulls with tanks one time 30 years ago doesn't mean it's illegal to suddenly generate a tank in tianmen square
>>103526127they could've trained the model to shit itself when you associate the tank with tianmen square
>>103526135>trained the model to shit itself when you associate the tank with tianmen squareThat sounds like a very difficult thing to do.
>>103526050how do you get this shiny lookoily?
lol
hey asshole >>103521168, that's my webbumshow yourself coward
>>103526160wow file a dmca
>>103526146something about "oiled glistening skin" will get you therefor that one it was "her shiny skin is oiled and glistening"
>>103526160you can't act like that, to make an AI model you must train it with millions of images/videos of people and not a single time you asked for your permission, that's hypocritical
>>103526061I hope you're not sending those emails to other engineers because it's dead obvious and everyone will hate you for wasting their time.Just send the prompt as the email. The only thing chatgpt does is inflate the word count.
>>103526186*for their permission
>>103526186the model is under a permissive license, my image isn'tsee you in court
>>103526209Anon is so fucked. He's really kicked the hornet's nest.
paper mario noobAI lora, neat how it works with various prompts:
>>103526225
>>103526236
>>103526140I'm sure it's possible.
>>103526209>the model is under a permissive license, my image isn'tand the images used during the training, were they under a permissive license?
>>103526270objection, irrelevanceIf tencent stole media without informing their customers then that would make their customers victims of fraud
>>103526295>that would make their customers victims of fraudthat mean we could also sue tencent? kek, gimme that moneyyy
>>103526295>without informing their customers*laughs in eula*
I noticed that the lower the resolution we use on hunyuan, the more zoomed in the outputs gets, as if when they trained the model with lower resolution, they simply cropped HD videos to fit into smaller resolutions, I hope it's not the case that would be retarded
>>103526326from what i know about ML there is a moderate to high chance this is the case
>>103526171thanks anon
>>103526347arachnophobia jump scare warning
>>103526347>dark souls 1.mp4
flipping through the license and apparently you're not allowed to use it if you're from the UK, EU or south koreaalso this travesty of English>You will defend, indemnify and hold harmless Us from and against any claim by any Third Party arising out of or related to Your or the Third Party’s use or distribution of the Tencent Hunyuan Works.which I think means you'll keep your mouth shut about copyright
>>103526401>you're not allowed to use it if you're from the UK, EU or south koreakek
>>103526401
>>103526401>flipping through the license
>>103526401>you're not allowed to use it if you're from the UKOI! YOU GOT A LOICENCE FOR THAT M8?
>>103526401why do you subject yourself to this meaningless unenforceable stuff
nice view!but seriously, this model does colors and lineart a lot better than pony imo, and backgrounds. better prompt understanding and a huge character set based on booru tags, so not as much reliance on loras. but you can still use them for styles and characters.
For hunyuang lora trainers are you guys captioning your datasets at all? I'm gonna train a lora tonight and have a quite large image set that's already captioned for an anime model (illustrious). Should I run them all through a captioner to convert them to natural language? Should I just leave them uncaptioned?If I get no response I guess I'll start with converting to natural language.
>>103523727Interesting, I would like to see an example that doesn't feel like slow motion, I wonder if that's how it worked, a video that could fit in 5 seconds extended with slower pace.
hunyuan I sometime get slow motion or super fast and I still don't get why
>>103526505it's just so vibrant compared to pony stuff (in general), I thought it was a vae issue but nope.
>>103523820This image was weird, the thumbnail made me think it was Taylor Swift, but then I clicked on it and it was someone else.
>>103526525I think it depends on the numbers of frames you put
>>103526513I can't definitely say whether or not captions hurt of help I've tried with and without and got more or less what I wanted. Just like flux. My anime experiments have no turned out well though and I don't know if that's just bad data or the model just doesn't play well with anime.
>>103526535I always use 97
>>103526526but the best part, it can do actual backgrounds, which pony can't really do.
>>103526547the number of frames recommanded by tencent is 129, that's the number the model is the most used to
>>103526533you just got A.I psyopped
>>103526543Ok, thanks! I'll try converting it to natural language, one issue is there's several characters in my dataset so I assume I need to give it something to identify them with. Hope it's not a disaster.
2b \(nier:automata\) with "casual clothes" in the forest, it even got the mole right, which pony loras would fuck up (and require inpainting).very good model.
>>103526587>there's several characters in my datasetThat always turned into a problem for flux, prays it works out for you but I wouldn't hold my breath.
>>103526599We all know.
lmao2b \(nier:automata\), mexican sombrero, casual clothes, masterpiece, best quality, newest, absurdres, highres, very awa, smile, mexico borderjust wanted to test an out there prompt. SOUL
wow, I never got an alice this good even with a pony nikke lora. genuinely impressed at the base model.
>1girls posted once more on /ldg/are we finally back?
>1girls spam are backIt's over...
So either way we're at slops-per-second.Sovl when?
>>103526734there is video too which is good, im just testing noobAI. it's legit a step above the other stuff and a new model was released recently. (vpred 09)we're getting to new levels both in static images and video generation.
>>103526756>it's legit a step above the other stufftrue, I've tested pony and noob finetunes back and forth, and illustrious/noob really has more going for it
>>103526756>it's legit a step above the other stuff and a new model was released recently. (vpred 09)some people feels it's a downgrade
>>103526766even if you ignore everything else, the fact that it can do backgrounds is a step above what pony finetunes could do. and I like the autismmix model, but the main gripe is you make nice model, meh background/scenery. so you'd have to chop the model out and gen a good background with something else.this for example is just a simple prompt with beach and jeanne d'arc alter \(fate\).
>>103526782some say the epsilon model is more lora friendly but ive had no issues, just have 2 models or whatever you prefer, ive had some good gens with the latest one and ill prob try others.
>>103526782People like this are just retarded consumers. Ignore and keep building.
>>103526782My guess would be it's even more annoying to tardwrangle in it's vanilla form. Just needs some finetune love I bet.
>Windows: https://rentry.org/crhcqq54shit guide doesn't work with python 3.12
>>103526886what error you got?
>>103526886Yes it does. Where are you tripping up?
>>103526917Do this, but pan it up to reveal a bog face. Or better yet furk.
>>103526928>Do this, but pan it up to reveal a bog face. Or better yet furk.you want that reaction kekhttps://www.youtube.com/watch?v=34gmbdmn3Gc
>>103526899AttributeError: module 'pkgutil' has no attribute 'ImpImporter'. Did you mean: 'zipimporter'?my machines dependencies could just be irreversibly fucked, i'll restart everything from an anaconda environment
>>103526959I'm so glad I venved the whole thing
>>103526959oh damn, this is what Claude told me
>>103526959>>103526993you get more infos herehttps://stackoverflow.com/questions/77364550/attributeerror-module-pkgutil-has-no-attribute-impimporter-did-you-mean
Any word from furk on a LoRA training post?
>>103526993>>103527001I think the best solution is to just create an environment with python 3.11 and roll from there
>>103526993I had zero problem installing sageattention and I'm using python 3.12.3
>>103527009What I want to know is what the fuck does he mean when he constantly spams "fully Fine Tune / DreamBooth" is it one or the other?
>>103527063>3.12.3dunno what version ComfyUi has, I'm still on 3.11.9, I thought the 3.12 version of ComfyUi was 3.12.7
bakerman
>>103527064I doubt he even knows.
>>103527134top kek
>>103526928This is why I always need to see the face, like if you give us a feet video I need to see who it belongs to!
>>103527064Schrodinger's LORA
someone post a list of all dependencies and version numbers for a working hunyan please
>>103527423>>103527423>>103527423
>>103526600Hmm I had mild success with it on flux. With SDXL it was horrible, basically found it impossible to do multi character loras in SDXL, had to rely on finetunes instead.