[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


80b Edition

Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev: >>106706484

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2203741
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbours
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
TELL ME NUNCHAKU
WHERE IS THAT WAN QUANT
WHERE IS THAT QWEN LORA SUPPORT
>>
>>106708345
>WHERE IS THAT QWEN LORA SUPPORT
here?
https://huggingface.co/nunchaku-tech/nunchaku-qwen-image-edit-2509
>>
Blessed thread of frenship
>>
File: 1736584904914097.png (1.93 MB, 1728x1344)
1.93 MB
1.93 MB PNG
>>106708106
Alright here's some (retarded) napkin math.

HDM is 340m params for $650.
Lumina2 and sd35m are both ~2.5b params. They both show strong capabilities at their size while suffering from training data and architecture problems, so an HDM-like model with its superior architecture and better training data could easily become local SOTA at 2.5b.

$650*2.5b/340m = $4779

HDM is undertrained according to the author, so let's double that budget. I suspect we hit severe diminishing returns with a budget >$10k. With current tech, we can get crowd (or richfag) funded local SOTA with <$10k. Keep in mind there are other optimizations that aren't used in HDM, so if one or two other 10x optimizations can be incorporated, we are in very good shape.

>>106708232
SDXL anime models like noob are the most fun by far.
I've mostly stopped non-anime SDXL models unless I need a controlnet or something. I still use SDXL for really big upscales since it has the tile controlnet and can fit the whole gen in memory while not taking too long. There are some impressive non-anime SDXL checkpoints out there, but the prompt comprehension issues and lack of art style knowledge hurt it significantly compared to Chroma.

>>106708267
what case/mobo do you use for this? I've been wondering if it's possible to fit a dual gpu setup in a mid tower these days
>>
File: ComfyUI_18499.png (2.43 MB, 972x1728)
2.43 MB
2.43 MB PNG
>>106706788
It got that soft/grubby SRPO look... they really need to show it doing something a little more impressive than stock photography. Maybe a complex interaction or something, I don't know.
>>
File: 1741778495766362.jpg (794 KB, 3072x2408)
794 KB
794 KB JPG
https://xcancel.com/SD_Tutorial/status/1970518843048272293#m
what's happening here? why SRPO is completly fucked on fp8?
>>
>>106708383
Sex with Jenny
>>
>>106708384
Maybe like 3-4 steps?
>>
File: 00112-3245483784.png (693 KB, 888x1008)
693 KB
693 KB PNG
>>
File: 1000010860.png (2.8 MB, 1728x1344)
2.8 MB
2.8 MB PNG
>>106708376
Where's the dataset coming from?
>>
File: interpolated_00064.png (1.18 MB, 720x1280)
1.18 MB
1.18 MB PNG
ÆÜGH Icum

>Error: Maximum file size allowed is 4MB
https://files.catbox.moe/f7x5jz.mp4
>>
>>106708429
ask chatgpt to make a script that compress your video to 4mb
>>
>>106708415
literally just use the danbooru api if you just want a booru model
>>
>>106708376
I have a full size case with a full sized motherboard with two GPU slots, I prefer extra space for easy moving than seeing how tight I can fit everything.
>>
File: 1747876613603820.png (3.61 MB, 1728x1344)
3.61 MB
3.61 MB PNG
>>106708415
catbox please?
for the anime side of things, a booru dataset augmented with NL captions (but not replacing the tags).
for the rest, desu, I don't see why something like LAION wouldn't work. it was fine for sd1.5. then augment it with some better captions, aesthetic selection.. use srpo or whatever too

>>106708486
damn... maybe I should give up on my dual gpu idea. don't really want a behemoth on my desk.
>>
>>106708413
All computer metrics are bunk in the scheme of generative models. The only real way to measure a model is promptability, level of censorship, breadth of conceptual and stylistic capacity, and ultimately distilled as "usability". Right now all the metrics are biased metrics essentially designed around making stock images but really if you want to see how shit aesthetics filtering is, just take 20k images from any booru and see which ones standard aesthetics metrics consider low quality.
>>
>>106708528
honestly i don't blame you for assuming gpu sizes either way, you really don't have a full idea of how huge or small gpus are until you actually get one in your hands.
then you totally forget once its been in your system for a year+.
dual gpu'ing is not for the faint of heart. or wallet for that matter.
>>
>>106708528
Why would you want a space heater on your desk? Two 4090s running full tilt even throttled gets quite toasty.
>>
File: grid-0002.jpg (2.21 MB, 7440x5376)
2.21 MB
2.21 MB JPG
trying to find a method to stop style swing but it's really bad in some cases especially at random seeds that tries to be irl during the first pass. I'm giving this model one last chance before dumpstering it
>>
>>106708358
Lora support anon, that is still being worked on. You can use model fine, though I think they messed up the lightning merge in that version
>>
>>106708429
why did that video need to be 11 seconds if its the same motion, REEETARD
>>
>>106708528
I use a basic old raidmax smilodon case from like 15-20 years ago and it fits modern GPUs fine
>>
having a danbooru data set with the tags grouped by subjects, background and interactions would make even SDXL based models exponentially better
we just need a powerful VLM like Gemini but uncensored
>>
File: 1744480545629914.png (2 MB, 1000x2044)
2 MB
2 MB PNG
>>106708703
nai seems to do something like that
https://docs.novelai.net/en/image/multiplecharacters

however, nai's implementation kind of looks like it's just calling on a regional prompting addon at least some of the time.

reminder that the forge regional prompting addon is able to generate region masks FROM THE PROMPT ITSELF, and we still don't have a comfyui equivalent even though this kicks ass:
https://github.com/hako-mikan/sd-webui-regional-prompter?tab=readme-ov-file#region-specification-by-prompt-experimental
>>
>>
>>106708799
how could you gen this absolute filth?
reported, filtered, snitched on, sent the batsignal
>>
>>106708827
I prompted black monolith lol
>>
Is there a straightforward way to get SD working on linux with an AMD card? I'm following the wiki installation and I kept running into issues
>>
>>106708799
sovl and kino
>>
>>106708799
That's the most disgusting thing i've ever seen on 4chan, and I'm an oldfag. You should be ashamed.
>>
>>106708415
>>106708528
NTA and also asking for catbox, thanks
>>
>>106708799
The best image posted in a long while
>>
File: IMG_2311.jpg (74 KB, 934x2000)
74 KB
74 KB JPG
>>106708328
I'm in the OP
>>
>gm
>>
File: file.png (76 KB, 558x845)
76 KB
76 KB PNG
I've set up Qwen Image Edit but it's maxing out my VRAM. I've tried launching Comfy with and without vram saving parameters. Is the Q8 model too big for a 3090 (24gb vram)
>>
File: IMG_2370.png (695 KB, 678x907)
695 KB
695 KB PNG
>>106708883
>>
>>106708883
..no? i have 16gb vram and can use q8 fine
>>
>>106708844
>make venv
>follow these instructions https://github.com/comfyanonymous/ComfyUI?tab=readme-ov-file#amd-gpus-linux-only
if that doesn't work, you will have to tell us more. card, distro, UI you're attempting to use. these steps worked on my old rx 6800 and my 7900 xtx.

>>106708883
try the fp8 scaled version instead. ggufs have a broken implementation on some cards, I have a similar issue on my 7900 xtx. fp8 scaled is faster anyway.
>>
>>106708872
>>106708892
You're a real piece of shit debo, it's also obvious that you like to bring up old irrelevant drama from other threads to force some conflict.
>>
>>106708931
Can't get it to work on arch with a 9070
>>
damn nigbo wthelly
>>
>>106708931
>try the fp8 scaled version instead
It's not faster on a 3000; on 4000/5000 it is.
>>
>>106708944
https://en.wikipedia.org/wiki/Nigbo_language
>>
File: Chroma2k-test_00050_.jpg (818 KB, 1184x1552)
818 KB
818 KB JPG
>>
File: file.png (332 KB, 3225x693)
332 KB
332 KB PNG
>>106708931
>>106708883
Do I need to run that pytorch step if I followed the Auto Installation? I can try it but idk if it will do anything
>>
>2025
>finetuned sdxl is still the best local model for realism and anime images
when are we gonna get an unslopped 4b-5b model with a permissive license?
chroma is slow dogshit that looks bad
seedream is the only new model that looks good but it's NOT local
>>
>>106708964
Can we not do this ritual post?
>>
File: interpolated_00067.png (1.01 MB, 720x1280)
1.01 MB
1.01 MB PNG
>>106708650
based retard doesn't understand what the pingpong effect is (or that it's a setting in those nodes)

anyway, GAAAHHH THE OOM IS EVERY GEN NOW CUMFARTUI YOU'RE PISSING ON MY LEG AND TELLING ME IT'S RAINING!

https://files.catbox.moe/dyugav.mp4
>>
>>106708959
reading comprehension anon. that pytorch setup is for the AMD linux user, not you. you should try the fp8 model instead of q8.
>>
>>106708941
>>106708931
Nevermind, got it to work with the manual install
Don't know why I bothered with the pip comfy-cli
Thanks
>>
>>106708959
>>106708883
oh yeah and if you're ever OOMing during VAE operations, replace VAE encode/decode with TILED VAE encode/decode

>>106708999
nice
>>
>>106708844
I think I saw a new beta version of rocm pytorch released today, in theory getting that should make things really straightforward
>>
File: Chroma2k-test_00004_.jpg (877 KB, 1408x2064)
877 KB
877 KB JPG
>>
File: 00114-2146955441.png (1.59 MB, 1008x888)
1.59 MB
1.59 MB PNG
>>
File: Chroma2k-test_00007_.jpg (617 KB, 1408x2064)
617 KB
617 KB JPG
>>
File: Chroma2k-test_00008_.jpg (726 KB, 1408x2064)
726 KB
726 KB JPG
>>
>>106708772
>and we still don't have a comfyui equivalent
>https://github.com/asagi4/comfyui-prompt-control/blob/master/doc/regional_prompts.md
>>
>>106708772
sounds like DAAM -> Latent Couple
>>
File: 1753982932434490.png (635 KB, 1288x808)
635 KB
635 KB PNG
>>
>>106708883
no? I use q8 on a 4080 with 16gb. it should be fine.
>>
File: 00659-39345720325.jpg (1.41 MB, 2688x2688)
1.41 MB
1.41 MB JPG
>>106708772
Dude this is better
https://github.com/Haoming02/sd-forge-couple
>>
>>106709346
what's with the obsession with that random dude
>>
File: 00116-3716891349.png (1.03 MB, 1008x888)
1.03 MB
1.03 MB PNG
>>
>>106709357
It's hard work to get to lolcow status
>>
>>106709357
It's obviously cause he worked at blizzard duh. Real answer the dude went on a weird tirade against getting game developers to create offline versions of games when they EOS them.
>>
https://huggingface.co/nunchaku-tech/nunchaku-qwen-image-edit-2509
>>
File: 1751386497849884.png (978 KB, 1080x652)
978 KB
978 KB PNG
https://www.reddit.com/r/StableDiffusion/comments/1nravcc/nano_banana_vs_qwen_image_edit_2509/
damn the lightning lora really sucks
>>
>>106709355
NTA but thanks for reminding me of forge couple
would you pls catbox that image?
>>
>>106709409
Ahh..man. Coping they will make a newer lightning LoRA.
>>
>>106709452
I don't have it on this computer, this was during my laptop era
>>
>>106709465
i mean just any forge couple'd gen will do, but understandable
>>
File: 1742378170494040.png (1.05 MB, 616x1696)
1.05 MB
1.05 MB PNG
>>106709460
8step one works fine in general for qwen edit v2.
>>
>>106709472
I'm not doing anything special and I don't give catboxes. It's a long story that becomes evident whenever you see the schizo screech the name ran.
>>
File: Chroma2k-test_00012_.jpg (857 KB, 1408x2064)
857 KB
857 KB JPG
>>
File: WAN2.2_00068.mp4 (3.43 MB, 960x544)
3.43 MB
3.43 MB MP4
>>
>>106709481
ran is the schizo
>>
File: 1731674190436019.png (1.19 MB, 1040x1000)
1.19 MB
1.19 MB PNG
>>106709475
>>
>106709486
>time wasting post
More wheelchairs for you then
>>
>>106709492
thanks schizo (niggerjak)
>>
File: 1733593142484038.png (1.07 MB, 1040x1000)
1.07 MB
1.07 MB PNG
>>106709487
>>
>>106709481
oohhhhh careful everyone. this mans hides national secrets in his prompts/ eat shit fuckface
>>
>>106709481
alright whatev fair enough

>>106709492
>>106709504
THAT was not me by the way.
>>
>He's triggered again
Every day you seethe here is a win for me and the other anons that separated from your mental illness and won.
>>
>phone posting to not look schizo
>>
>>106709509
Well there's your proof, this guy has been seething at me for 3 years all because I told him to stop spamming the thread with slop worse than you see on /sdg/ today while trolling and messing with people. You can see examples of his poor handiwork in OP.
>>
>>106709510
>>106709521
>>>/g/sdg
>>
>>106709521
slopmeister
>>
>>106709510
>Every day you seethe here is a win for me
so that's your goal in life, to own the anonymous libs on 4chan?
>>
>>106709539
no it's to blogpost and share shitty diffusion gens
>>
>>106709517
He pulled this exact thing when a anon was making fun of him in his containment thread yesterday and does it so often it has no effect anymore. He's so autistic and ritualistic he does the same thing every day non stop
>>106709539
>seethes about me near daily even when I'm gone for months
I dunno what to tell you
>>
gtfo ranfagggg
>>
harumpfff!!! I am the most important poster here and everyone should listen to what I have to say AT ALL TIMES! totally NOT a schizo so stop bullying me!

t. ran
>>
With comfyui, how can I start genning an image with one checkpoint, then switch to a different checkpoint?
>>
File: sex.png (63 KB, 912x400)
63 KB
63 KB PNG
how do I voice sex with my computer?
Like is vibevoice censored?
>>
>>106709567
are u a homo?
>>
File: 00085-475584187.jpg (708 KB, 2480x2688)
708 KB
708 KB JPG
Since he's having a melty I'm going to do something he's too autistic to do without getting exposed and post a gen. Progress on taming Chroma seeing some improvement
>>
>>106709568
basically just make a workflow that does it? you can pass the unfinished latent to the next sampler
>>
>>106709575
The reference audio has to have sexy sounds in it, then increase CFG 1.7+ and lower steps between 10-15
>>
File: 1729672694736454.png (39 KB, 1066x259)
39 KB
39 KB PNG
>>106709552
>about me
who the fuck are you? smells like some insane main character syndrome
>>
>>106709568
link different model to next k-sampler?
>>
>recycling the same memes and projection
>why can everyone point me out
>why is the thread I dedicate 15 hours of my worthless life necro bumping not getting any post
Gets the almonds activated, he's in such a diminished state and he just comes back for more punishment. Last post for you just keep seething until your caretaker tells you to shut up
>>
File: 00117-644131837.png (1.3 MB, 888x1008)
1.3 MB
1.3 MB PNG
>>
>>106709600
got example ?
>>
File: WAN2.2_00073.mp4 (3.68 MB, 544x960)
3.68 MB
3.68 MB MP4
>>106709485
>>
>>106709350
>>106708909
How is it using 24gb VRAM if other people are using the same models on 16gb? >>106708959
I'd rather fix that than download the FP8 scaled one (which I can't find, I can only see the Q ones)
>>
Where's the qwen edit anon. Post workflow for drawing to realism pls, I fiddled with it a bunch yesterday but it's not even outputting semi-realistic anymore, just reprinting the same drawing. Someone save me from this hell, img2img with illustrious models are better at realism but dogshit in keeping the character consistent with the reference and always fuck eyes hands feet teeth and such
>>
>>106709592
>>106709610
It gives an error about multiplying dimensions
>>
File: american-psycho-card.gif (221 KB, 220x131)
221 KB
221 KB GIF
>>106708799
>>
Radiance is pretty cool but it makes my 4090 caps whine in a rhythm like an incoming text in the 2000s.
>>
File: WAN2.2_00075.mp4 (3.59 MB, 544x960)
3.59 MB
3.59 MB MP4
>>106709635
>>
>>106709568
Chain samplers?
First one with the first model, second one with the second model, pass the latents (and maybe upscale in between)
Note that the denoising of the second sampler needs to be below 1 to get meaningful results with this.
>>
>>106708376
>With current tech, we can get crowd (or richfag) funded local SOTA with <$10k.

agghh!!! you fool. don't do that. don't give me hope
>>
File: 00118-2739512918.png (1.37 MB, 888x1008)
1.37 MB
1.37 MB PNG
>>
File: WAN2.2_00078.mp4 (3.78 MB, 544x960)
3.78 MB
3.78 MB MP4
>>106709700
>>
>>106709783
balenciaga?
>>
>>106709756
>>106709624
nice gens. now go back to your general.
>>
>>106709674
isn't illustrastious sdxl? You are using sd1.5 workflow
>>
>>106709631
>>106709575
Just picked some random porn vid, would be better if I cleaned the audio but too lazy kek
https://files.catbox.moe/5mimvk.mp3
>>
File: WAN2.2_00081.mp4 (3.68 MB, 544x960)
3.68 MB
3.68 MB MP4
>>106709783
>>106709792
hehe no
>>
File: 00119-2203196135.png (1.41 MB, 888x1008)
1.41 MB
1.41 MB PNG
>>106709795
thanks
>>
>>106709674
1. You are mixing ancient model with newer one (Though this shouldn't necessarily cause multiplying dimensions error?)
2. 10 steps will just give shit results
3. Lower denoising otherwise what you are doing is 100% pointless
4. You are mixing a 1024p and 512p model
With all due respect you probably need to learn more before trying quirky stuff like this.
>>
>>106709783 >>106709700
nice effect. the things people will do when they get the ability to do whole scenes of 30s-2min or so
>>
>>106709709
I think thats what I'm doing, I have one sampler with the first model, then I link it to a second sampler with the second model

>>106709818
Sorry, I don't know what that means
>>
File: 1749110973333005.png (1.02 MB, 832x1248)
1.02 MB
1.02 MB PNG
the girl in image1 is wearing the outfit of the girl in image2. make the image realistic.

not bad
>>
>>106709834
increasing cfg works.
Does vibevoice have prompt guide?
>>
File: 92252609.mp4 (3.97 MB, 848x480)
3.97 MB
3.97 MB MP4
>>106709635
Neat.
>>
File: file.png (79 KB, 1014x843)
79 KB
79 KB PNG
>>106708931
Looks like you were basically right, it's not loading it as quantized (sorry if phrasing is wrong, I just wanna make funny image)

I found a link to the FP8 model in the workflow itself so I'll download that, but I wonder if Wan2.2 is having the same issue. Any ideas on how to make it use the Q8 model properly?
>>
>>106709925
haven't seen one if there is.
>>
File: 1730854689275005.png (1.18 MB, 832x1248)
1.18 MB
1.18 MB PNG
kek, used ivy from soulcalibur as the swap source:
>>
>>106708931
>fp8 scaled is faster anyway.
Not for hardware that lacks fp8 acceleration, which includes RDNA3.
Nice AYYYYYMD cope though.
>>
>>106709987
if the model was lewder, would she keep the whip?
>>
>>106710013
the image source didnt have much of the whip but with a lora you can do anything desu
>>
File: 1735644782959154.png (1.25 MB, 856x1216)
1.25 MB
1.25 MB PNG
>>
>>106709909

miss this be local diffusion thread
>>
File: 1728962776551370.png (1.02 MB, 856x1216)
1.02 MB
1.02 MB PNG
>>106710038
>>
File: 00120-1195684579.png (1.34 MB, 888x1008)
1.34 MB
1.34 MB PNG
>>
File: file.png (71 KB, 723x301)
71 KB
71 KB PNG
In wan2.2, I'm getting a "ModuleNotFoundError" for decord despite having it installed in the venv. I'm not well-versed in Python so I'm hoping one of you might know what's happening.
>>
>>106710021
can be quite hard to train. at least the more creative whip swings. pretty cool regardless
>>
Does your guys' GPU have a coil whine? My GPU whines when I generate AI images, AI chats, and when I Cycles GPU render in Blender. It's silent when I'm playing video games.
>>
>>106710080
ain't it python3 you need to call?
>>
>>106710080
Run pip check.
Also
>wan/bin/pip
I don't know what's up with that so your venv might be broken.
Try creating a venv (with uv or whatever), source venv/bin/activate and pip install -r requirements.txt. Then run python generate.py
>>
>>106710097
https://vocaroo.com/168rSoBGUnsk
>>
>>106710097
V-sync?
Try a game that runs on a few hundred something fps.
>>
>>106709795
he's tryna back up his only friend nigbo whilst being unemployed xd
>>
>>106710097
I definitely had some audio interference when I got my 3090, and either my hearing has got worse or it stopped
>>
>>106710103
this has to be 11/10 nu-/g/ bait this has to be this just has to be

>>106710080
use miniconda. fuck venvs use miniconda. oh nooo my console bloat fuck you python is all bloat its all shit everything you do on a computer should be disposable USE MINICONDA FOR CUDA SHIT FORGE WILL SAVE YOUR ASS
>>
>>106710113
That was it, didn't activate the venv. Got a different error now, but at least I can work with this. Thanks.
>>
what is the moderest wan2.2 workflow? Rentry talks about "old" wan2.2

also, where are goofs for wan2.2???
>>
>>106710150
>what is the moderest wan2.2 workflow? Rentry talks about "old" wan2.2
https://vb.lk/wan-2.2-comfyui-example-workflow-kijai-github

>also, where are goofs for wan2.2???
https://vb.lk/wan-2.2-gguf-huggingface

retarded zoomer faggot
>>
File: 1755194100845587.png (1.17 MB, 896x1160)
1.17 MB
1.17 MB PNG
the man in image1 is wearing the outfit of the man in image2 and is wearing a black fedora, and holding a katana. keep his expression the same.

literally edgemaster
>>
>>106710173
not clicking dat sheit
>>
>>106710173
MODS
>>
>>106708328
Not sure if that's the right thread to ask, but does anyone know how these AI covers get generated?
I'm pretty sure the common sites disallow commercial music remixes, and frankly nothing I've tried sounds as "good" as these.

https://www.youtube.com/watch?v=HIjdlLSOtvE
https://www.youtube.com/watch?v=PeBkXhfchb0
>>
>>106710173
>vb.lk
probably an ad but that's still pretty cool stuff

>>106710213
retard. dumbass. cumslurper. twobit piece of shit.
>>106710215
it's fine, i clicked it. it basically yolo guesses the end of the link to something that makes sense
>>
File: 1748611921979109.png (1.12 MB, 768x1360)
1.12 MB
1.12 MB PNG
lara croft if netflix didnt make it:
>>
File: 1758332339826230.png (878 KB, 1024x1024)
878 KB
878 KB PNG
>>
Anyone tried this Forge fork?
https://github.com/DenOfEquity/ersatzForge

I couldn't get it to run (python dependency), but the dev updates it daily since it's his personal build.
Apparently, this is the original repo that Panchovix based ReForge2 on, and this one is still actively developed.
>>
No I stick with comfyui
>>
>>106709329
I don't see where this generates the masks from the prompt
>>
>>106710325
Please. Please no more forks.
>>
File: 00122-2611780847.png (873 KB, 1008x888)
873 KB
873 KB PNG
>>
File: latest.jpg (96 KB, 633x758)
96 KB
96 KB JPG
WHY DID NOBODY WARN ME THAT I HAD TO PUT A FUCKING "SAVE IMAGE" NODE OR ELSE THE IMAGE IN COMFY GETS DELETED!?!?
>>
>he can't right click save on the preview node
>he can't see his temp folder
NGMI
>>
>>106710009
>>106708945
good to know, it does seem like rdna3 lacks fp8 acceleration (at least I can't find anything saying it does), though it handles fp8 just fine of course.

>>106709963
I would just use the fp8 scaled model for wan as well. I have never had a good experience with gguf, probably won't until I upgrade to UDNA in the future.

>>106710097
yeah lmao, it's not so bad on the xtx, but back when I was using the 6800 it straight up sounded like it was screaming in pain
>>
File: 1737574003235457.png (1.36 MB, 1360x768)
1.36 MB
1.36 MB PNG
man I love AI.
>>
>>106710423
it was slop bro, dont cry about it. you'll gen better
>>
>>106710433
HELP ME!
>>
>>106710423
they're still on your temp folder though?
ComfyUI\temp
>>
>>106710423
we honestly hate you
>>
>>106710423
You need another node for protest, nonie :3
>>
File: image.jpg (48 KB, 680x414)
48 KB
48 KB JPG
>>106710467
>the folder is empity
>>
>>106710463
RIGHT CLICK ON THE PIC IN THE PREVIEW NODE RETARD CLICK SAVE AS
>>
>>106710467
Wrong. He needs a node to enable the temp folder.
>>
File: 1744769913943205.png (959 KB, 824x1256)
959 KB
959 KB PNG
the anime girl is sitting at a computer typing. on her white crt monitor is the text "LDG" with a chibi version of Miku Hatsune below it.
>>
>>106709930
i'm cool
i'm hip
i'm with it
>>
>>106710441
>though it handles fp8 just fine of course.
fp8 has less quality than int8 for diffusion, scaled or not.
On Blackwell it is arguably worth is because it gets a big speed up with dedicated acceleration.
Without that you are just degrading quality without any speed boost.
I would try to get Q8 working if I were you.
>>
>>106708376
Who would in their right mind waste $10K just to entertain autists on /lmg/? It better makes some serious money.
t. got the funds
>>
File: 1739319102445965.png (1.19 MB, 824x1256)
1.19 MB
1.19 MB PNG
>>106710510
>>
VAE decode(IMAGE) ------->(IMAGE) Save Image

WHY THE HELL ISN'T MY IMAGE SAVING IN \ComfyUI\output!?!?!?!
>>
File: ComfyUI_03412_.mp4 (653 KB, 896x1152)
653 KB
653 KB MP4
>>106710556
I bought a RTX 600 pro and use it mostly for gens that get posted in here and elsewhere on 4chan (I'm trying to use it for professional reasons too but failing at that).
>>
>>106710589
Please stop shitting up the thread with woahjak spam, thanks.
>>
File: images.jpg (26 KB, 244x207)
26 KB
26 KB JPG
>>106710605
HELP ME HELP ME
WHY DON'T YOU HELP ME
I SWITCHED TO YOUR DAMN COMFYUI SIDE,
DON'T LEAVE ME ALONE IN THIS!
>>
>>106710604
>RTX 6000 pro
What's the use case of this for diffusion?
I thought it excelled at MOE LLMs (still overpriced though), and was very overpriced and mediocre for anything else.
>>
>HuMo & Chroma1-Radiance Native Support in ComfyUI
Uh where the comfy haters at?
>>
>>106710605
catjak faggot stfu
>>
>>106710604
Reminds me of this.
>>
File: antiai.png (1.47 MB, 1024x1024)
1.47 MB
1.47 MB PNG
>>
File: werewolfgunslinger.jpg (613 KB, 2808x2136)
613 KB
613 KB JPG
>>
File: 00064-516587371.jpg (1.78 MB, 1792x2304)
1.78 MB
1.78 MB JPG
>>106710658
original cutscenes & art when
>>
>>106710658
Prompt?
>>
wan2.5 is api only right
>>
File: 00073-516587372.jpg (1.43 MB, 1792x2304)
1.43 MB
1.43 MB JPG
>>
File: images(1).jpg (26 KB, 227x222)
26 KB
26 KB JPG
NOW IT'S SAVING IMAGES FOR ME BUT I WANT TO CHANGE THE DIRECTORY, GET IT OUT OF COMFY
HOW THE HELL DO I REDIRECT THIS!?!?!?!
>>
>>106710739
symbolic link or --output-directory "folder/path" in startup arguments
>>
>>106710689
Neat
>>
File: 00091-516587371.jpg (1.5 MB, 1792x2304)
1.5 MB
1.5 MB JPG
>>
Reminder that with the upcoming Hunyuan Image 80b, if you do not have a 96gb vram card you are not allowed to discuss the model. >>106703161
If you cannot afford to upgrade to a 96gb+ card, you are a poorfaggot turdskinned larper who should stick with SaaS



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.