[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: collage.jpg (2.25 MB, 4001x3083)
2.25 MB
2.25 MB JPG
Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>106924532

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2203741
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
>>106929651
STINKY
>>
*brap*
>>
>>106929646
If you can't tell the difference in detail in that comparison then you are blind.
>>
File: 00001-1360739014.png (467 KB, 512x640)
467 KB
467 KB PNG
>>
>made it in the collage
genning bros... we won!!!
>>
>>106929682
post a comparison of something hard like a group of people smiling at a festival fighting with swords instead of a generic tranime style that is seed variability tier difference between the images
>>
>>106929699
It's the same seed retard. And tell me HD Flash looks anything close in quality to the others. Go on.
>>
>>106929708
>It's the same seed retard
yes retard, thats why i said "seed variability tier difference" and not literal seed difference

tranimeretards prove once again they are subhuman TIER low IQ
>>
>>106929711
>posted from my 720p TN panel from 2005!
>>
Blessed thread of frenship
>>
File: file.png (107 KB, 1724x686)
107 KB
107 KB PNG
so why is everyone using wan again now? is something new out?
>>
>>106929699
Tbh, those are two people engaged in somewhat out-of-distribution activity, so the test is pretty valid. But with chroma it's almost impossible to change the aesthetics if these slice of life scenes. They are always 'crappy comic' style.
>>
>>106929771
They aren't done yet with that very lora you've posted.
>>
>>106929611
>>106929624
>>106929636

Followup on rouwei for >>106928168
>>
sirs?
https://huggingface.co/lightx2v/Wan2.2-Distill-Models/tree/main
>>
>take pov picture of my cock while sitting
>twilight sparkle \(mlp\), anthro, face focus, kneeling, looking at viewer, open mouth
>use photoshop to put twilight between myself and the background
>wan i2v with oral insertion lora
I'm obviously not going to post results but they are very good and it's going to last me at least a few more days.
Try it with your favorite 1girl.
>>
>>106929778
just hopping on for my daily ldg session. i think i get what you mean, but do you have a gen or two to catbox as examples?
i get tunnel-prompting trying to chase the ghost of a particularly good seeds and it stops me from exploring ideas. helps me a lot to see what others are cooking with.
>>
File: 1533897304390.jpg (54 KB, 500x500)
54 KB
54 KB JPG
>>106929819
>spend time and resources doing a thing
>"model card empty"
>>
how do I gen wan i2v for more than 5 secs
>>
>>106929819
its just light2x baked in the wan checkpoint. i dont know why anyone would use that over a regular lora though.
>>
>>106929819
Essentially the same as running lightx2v, is it not? If so, there's zero point in using this over the full models with the LoRA's.
>>
>>106929941
> its just light2x baked in the wan checkpoint
retard
>>
>>106929936
It can't do more than 101 frames without looping. Any more, and you start to see it wanting to return a character/scene to the position it was in in the first frame. Every method to go beyond that has serious drawbacks, except for Stable Video Infinity it seems, but that's not out yet for 2.2 and nobody gives a shit enough about 2.1 to port it to comfy.
>>
File: chroma___0001.png (1.64 MB, 896x1152)
1.64 MB
1.64 MB PNG
>>106929823
this nigga need a wellness check
>>
>>106929956
rife doesnt have any drawbacks, but it's only 7 seconds. those 2 extra seconds are worth it though
>>
>>106929960
Romans 1:26-27.
>>
>>106929971
You mean RifleXRoPE? That even work with 2.2?
>>
>>106929823
>ponyfaggot
unironically kys faggot
>>
>>106929823
based
>>
File: context.jpg (22 KB, 514x269)
22 KB
22 KB JPG
>>106929936
Wan context window nodes. I just set length to the amount of frames, set frames for the context node and i2v node to 165 for 10 secs. Be sure you have some kind of strong motion lora otherwise movement will be stiff
>>
>>106930008
Could you share a workflow or image of it? Nobody's shown a working workflow with these. There's two models, so how does that work? Which one do you plug them into?
>>
File: 2025-10-18.png (3.96 MB, 2688x1252)
3.96 MB
3.96 MB PNG
>>106929800
Yeah I tried yours and they don't work but these do. Therefore you can't say it loses ALL the artists. And after the required training to merge the shit properly it will work as before, only better. https://files.catbox.moe/bey9ol.png
HO
PE
>inb4 no sysprompt
>>
File: dmmg_0008.png (1.42 MB, 896x1152)
1.42 MB
1.42 MB PNG
>>106929675
>>
>>106929978
yes
>>
>>106930037
model?
>>
>>106929936
just increase the number of frames
>>106929956
I can do 161 frames without looping in some contexes. for example, I can do a girl lying on her back and a man enters, sticks his dick in her pussy and fucks her for the rest of the video without leaving. it's hit or miss sometimes though. if you pan or zoom the camera or if there's any significant movement or change in position of the subjects then it will 99% of the time loop
>>
>>106930008
I thought context windows was only for T2I? It was explained that due to the way it works, it's impossible for it to extend I2V videos.
>>
>>106929800
>>106930037
Hello, yesterday I did this tests >>106921060 of 2 characters fighting using RouWei Gemma variants Neta Lumina and regular WAI15. Not base RouWei.
Gemma variants had the worst tag bleeding and would not follow instructions even with Mithly github formatting. They were slightly more creative with camera angles and expressions but then I went with WAI15 without regional prompting.And for my perspective WAI15 was more solid with less tag bleeding than RouWei.
Tested Neta Lumina also with multiple prompt styles, prompt book settings, tags, short captions, artist tags. Still underbaked.
Got the workflows saved if anyone wants to test the Gemma variants themselves. WAI15 is better at following instructions for this use case in my experience.
>>
>>106930066
It does only work with t2v, which is why nobody can show a working example for i2v, they're just trolling
>>
>>106930078
i don't see the purpose of trolling when an anon is geniually asking for help. it just spreads misinformation for no reason. I think anon legit thinks it's working for them when in reality it's doing nothing.
>>
>>106930066
>>106930078
It works, it just repeats the movement
>>
>>106930055
Metadata catboxed. Unless I used the wrong one and not the one op was talking about.
>>106930071
Both Rouwei with frankencoder and any Lumina version are way too raw, sadly. At least in case of the former I'm absolutely sure it will be improved and brought to completion. The latter, unlikely.
>>
File: 1740940526467928.jpg (31 KB, 589x520)
31 KB
31 KB JPG
>>106930106
>you cannot extend i2v because it loops the video
>it works, it just loops the video
>>
>>106930106
>It works, it just repeats the movement
REALLY NIGGA?
>>
Please sir... just a crumb of Stable Video Infinity... a single rock scattered in the carpet is enough...
>>
>>106930065
Yeah its pretty good for repetitive movement, anything more than its just looping. There's super vace tools that's ok, probably the best "seamless" nodes and workflows I've seen https://github.com/bbaudio-2025/ComfyUI-SuperUltimateVaceTools
>>
>>106930150
I'm honestly surprised some nerd hasn't made a node for it yet even if it's only for 2.1 currently
>>
File: 00014-652290981.png (1.23 MB, 1344x768)
1.23 MB
1.23 MB PNG
>>
>>106930165
I always wonder why the dinks who make all this shit don't just use comfy by default, they release some sweet new tech gated behind python venvs and shitty gradio interfaces, so you can't use any of the currently existing optimizations without manually inserting the code.
>>
>>106930195
diffusers is the most up to date. comfy just leeches off of that
>>
>>106930195
why didn't comfy just make PRs to differs to make it less cancerous and just use that?
>>
>>106930195
Because they have access to H100s. They are doing research, not a product.
>>
>>106930195
why don't they just give C libs with bindings to whatever lang you like?
>>
File: wan22___0152.png (1.77 MB, 832x1216)
1.77 MB
1.77 MB PNG
>>106930195
these guys are good with python and diffusers and models, they don't need to be learning comfy that may or may not be around in a few years, obseleting their research
>>
>>106930221
Because C is hard. People can't figure out memory and stacks. Spaghetti GUI is the way deal with reality: slow it down, make it pretty, engage the user without empowering them and make it clicky. They think they're accomplishing shit..
>>
>>106930231
arty. me likes the aesthetic.
>>
>>106930231
this. comfy is a shitty application that was rushed out for a quick buck. it's plugin tier/ prototype tier forever and shouldn't be put on a pedestal as the professional implementation.
>>
>>106930244
too bad it's shit at doing that. the ecosystem sucks, the redundancy sucks and the front end keeps getting worse. fuck comfy for hiring chinks and jeets. everything is so shit compared to a year ago
>>
File deleted.
>>106930253
thx anon

>>106930255
i like comfy but i'm a programmer so it makes sense to me and i like that i can just modify the nodes how i want to. it's just open source and proprietary, building on top of standard tools is how research is done
>>
>>106930340
>makes sense to me and i like that i can just modify the nodes how i want to.
the problem is you can't modify nodes set in stone. you are a programmer so you should see the glaring issue of redundant nodes definitions. the arch is all wrong for scaling non destructively, depenancies can break the whole project if a custom node changes any version, the front end is slow garbage and getting into bad UI/UX practices and the web tech stack is just horrendously wasteful for something that should just be a C exe. this is the glaring issues of comfyui and most can never be improved.
>>
RELEASE THE 2.2 LOW NOISE I2V V2 ALREADY
>>
File: Video_00066.mp4 (890 KB, 480x704)
890 KB
890 KB MP4
>>106930397
i can recognize that these are issues of all open source projects especially when you extend that to community add-ons.

if you only use the core app with the core provided modules then you limit functionality, if you open it up then you get multiple people using multiple dependencies and no opinion as to what is bloat and what is required. you are arguing in favor of a closed source product that would be free yet provide all functionality you could ever want. Development would slow to a crawl and we wouldn't have things like a 24hr turn around new models drop. C is not the right language for this at all. but i see your opinion.
>>
>>106930424
>when you are drunk as fuck and go to the fridge to get a snack but the fridge starts dancing
>>
File: chroma_flux__0039.png (1.02 MB, 832x1216)
1.02 MB
1.02 MB PNG
>>106930450
rofl
>>
>mysterious anon that demands a c binary randomly is back
Oh so we know what comes next
>>
>>106930506
Rust static binary?
>>
>>106929956
good to know there's no way to do that
>>
>unironically using WAI
>a slop mix
>
>>
>>106930723
we have retards here suggesting 'wan smooth mix', so everything goes
>>
>106930723
>we have retards here suggesting 'wan smooth mix'
still? fucking kek
>>
>>106930751
my b >>106930736
>>
Finally installed "real" Comfy in a venv instead of portable, went with python 3.12 instead of portable's shitty 3.13, which has incompatibilities with certain extensions. All the issues I had with my portable install are gone. Totally worth it.
Thanks for reading my blog entry.
>>
>not using 3.14
grandpa getting left behind again
>>
>>106930804
nta but 3.12 has extensive compatibility with any and every node you can think of, everything past that has spottier compatibility with no benefit to the 'upgrade'
>>
quote me on this all ai video is still garbage
>>
>>106929651
Why is AniStudio not in OP?
>>
>>106930828
https://vocaroo.com/1kyQwNQy9lYo
>>
>>106930424
>if you open it up then you get multiple people using multiple dependencies and no opinion as to what is bloat and what is required
this isn't a problem with shared binaries. they can have conflicting dependencies and still work together. you aren't a very good programmer if all you think of is python which is such a shitty language to begin with
>>
>>106930826
keep coping grandpa
>>
>>106930828
There's only a few good use cases
The anon who posted the rouge twerking videos one of the last threads had the right idea
>>
>>106930424
>C is not the right language for this at all.
this is such a a brainlets take
>>
>>106930841
https://vocaroo.com/1fz4FNaSKnDv
>>
>>106930832
>AniStudio not in OP?
Get a notable mention on sd.cpp first. gz on 30 whole stars!
>>
>>106930973
just pr it for ani. every other one added themselves
>>
1girl, huge bazobaboobas, gargantuan titties, incredible masterpiece naked woman HAVING FUCKING SEX WITH A 1BOY
>>
>>106931013
>1girl, huge bazobaboobas, gargantuan titties, incredible masterpiece naked woman
based
>HAVING FUCKING SEX WITH A 1BOY
Cringe and cuckpilled
>>
retard
>>
>>106931020
don't self insert as the woman anon
>>
File: screenshot.1760809018.jpg (294 KB, 1366x860)
294 KB
294 KB JPG
the problem i have with ani is that it's very easy to get overwhelmed visually because there's no distinction between different menus. they all fuse together and look the same.

The floating windows are also pointless because you can't drag them outside the main application. I get it. you're a programmer not a UI designer, but you really need to get someone that understands UI design. you cannot possibly expect your average user to feel comfortable with this

I also suggest automatically collapsing the options in the T2I/I2I tabs. You want as little visual cluster as possible. Also, why does resizing the main app SCALE it? That's now how resizing windows should work.
>>
why did you name yourself trAni? actual retard
>>
>>106931074
>you're a programmer not a UI designer
hes neither of those things actually
>>
File: 2025-10-18_13-51-22.webm (3.41 MB, 1600x900)
3.41 MB
3.41 MB WEBM
>>106931091
well they got something working so i'll give them that.

also
>Also, why does resizing the main app SCALE it? That's now how resizing windows should work.
my bad on this. It updates correctly after resizing is done. Still, that threw me off and looks bad. It should update in real-time while resizing
>>
File: ComfyUI_00022_.mp4 (575 KB, 720x1280)
575 KB
575 KB MP4
>>106930828
>>
yeah just use forge if you don't want to deal with spaghetti, the best interface is just the a1111 version everyone accepts this
>>
>>106931185
EWWWWWWWWWWWWWWWWWWWWWWWWW
>>
>>106931074
>Also, why does resizing the main app SCALE it?
do you really have to ask? he's a fucking nocoder retard who cobbled together a shitty gui using a dev framework, do you think he knows how element scaling and DPI works? LMAO
>>
>everyone accepts this
>>
>>106930053
I have never seen anyone share a working workflow of rifleXrope using wan 2.2 in either i2v or t2v

Not even a screenshot. If you could share anything that would be greatly appreciated
>>
File: file.png (1.36 MB, 896x1184)
1.36 MB
1.36 MB PNG
>>106931212
theres 1 for 2.1, just copy that part. are you literally unable to connect 2 extra spaghettis?
fucking retards in this board I swear
>>
So did she secure funding yet or
>>
>>106931239
no, the japan trip was complete cope, probably dreamed during on of his self-asphyxation sessions
>>
File: rif.png (148 KB, 598x957)
148 KB
148 KB PNG
>>106931212
I tested it and noticed no difference between having it on and off at 129 frames (7 seconds). Once it gets to 101 frames, all frames after want to return to that initial input frame. Sometimes you get lucky on a seed and it feels more natural, but I don't think that rifle is doing that, because that happens randomly with no rifle enabled as well.
>>
>>106931212
Because it's a singular node that you connect to the ksampler. Now, I don't know if I'm doing it right, but I only connect it to the high noise model. I have no idea if you should connect it to the low noise too.

>>106931250
Have you tried playing with the k value? I guess maybe it doesn't work in wan22 after all and I just got lucky with some gens.
>>
>>106931285
>Have you tried playing with the k value?
Not yet. Hun used 4, 2.1 used 6, worth trying it in increments of 2 on the same seed I guess.
>>
ani here to answer some of these concerns:

>>106931074
>it's very easy to get overwhelmed visually because there's no distinction between different menus. they all fuse together and look the same.
yeah, until there is some options like fonts, icons and workspace tabs it's going to be a bit samey.

>The floating windows are also pointless because you can't drag them outside the main application.
enable viewports in the imgui render settings. if I could make them autodock I would have already had it in if viewports are off. I am assuming folks will just dock it naturally so it's not really a big deal

>I also suggest automatically collapsing the options in the T2I/I2I tabs.
this is something I'd like to have when I have nodegraph templates and you can just switch between presets

>>106931138
probably an imgui flag for this but it's not really that important. sorry it bothers you
>>
>seems like lot of folks itt are butthurt and riled up
>>
>julien
>>
>>106931372
>ani here to answer some of these concerns:
you forgot to address >>106931086
>>
>>106931431
ani and the notorious deebster have that effect
>>
>trani shilling his half assed wrapper that took him 3 years again
>>
>>106931463
Good things come to those who wait
>>
>tranny schizo giga-melty when someone more talented than them posts updates
every time. get a fucking job you loser
>>
>>106931488
i'm still waiting on the blessed ani shota collection some faggot said he was going to drop and still hasn't
>>
>>106931495
Ran is like that. He wants to control everything and everyone. Quite amusing really.
>>
>ranfaggot has a melty again because ani delivered
everytime lmao
>>
>girl who a year ago told me my "pro-ai" points were "basic" and "not new" now posting shit like the miyazaki "anti ai" video
I hate women so much it's unreal.
>>
>>106931488
how long do we have to wait before you kys? up that hrt dosage already sis
>>
I actually like that ani has answers to weird decisions about his project. comfy doesn't have any rhyme or reason to the retarded shit he does or refuses to do
>>
>>106931633
>comfy doesn't have any rhyme or reason to the retarded shit he does or refuses to do
how do you know
did you try asking
comfy doesnt lurk these threads 24/7 like ani does. now that comfyui has a reputation with proper funding and a growing userbase, he will most likely post less or not answer every little question.
>>
>>106931633
kys trani, stop replying to yourself
>>
>>106931680
every time he was here people would ask and he would conveniently disappear every time
>>
why is anon so obssessed with trannies?
>>
>>106931702
ask what? I've asked him questions and he answered them. obviously he's going to ignore bait from ani and the comfy schizo
>>
what are some good 1girl ethnicities?
>>
>>106931711
why is feature x so retarded? why isn't there feature y? it's a long list of dumb shit with 0 answers. honestly, fuck comfyui and we finally have something else to prop up now
>>
>>106931728
stop shitting up the thread with your useless software, retard
>>
>>106931752
I didn't make anything, I am just a retard anon, same as you. maybe a lot more schizo than me but the point still stands
>>
>he's disassociating
>>
File: gen_00035_.png (395 KB, 1024x1024)
395 KB
395 KB PNG
>>
>>106931728
>we finally have something else to prop up now
no we don't. doesn't matter how better you think your project is, no one is touching that UI that looks like it came from windows 98.

it's very, very ugly. you to hire a UX designer.
>>
File: gen_00028_.png (239 KB, 1024x1024)
239 KB
239 KB PNG
>>
>>106931372
Your name is julien
>>
>>106931782
im unable to process the level of kino in this gen
>>
>select "slowest" webp save method for overnight gens
>adds 19 minutes to a 20-minute gen time
>wake up to half as many videos
Oops...
>>
>>106931782
I remember in SDXL and earlier it was impossible to produce a gen that was just pure black, no matter how clever your prompting. When SD3 came out and was able to do it I was amazed
>>
>>106931809
maybe done save it to a system hdd?
>>
File: gen_00025_.png (516 KB, 1024x1024)
516 KB
516 KB PNG
>>106931802
check this out
>>
This blessed thread of frenship is talked about all over the chanz. The most popular and defacto ai generation thread that's for sure.
>>
keep having these melties anon. I'm sure everyone here wants to hear it
>>
>>106931074
just wait 4 years
>>
>>106931712
>>106931698
>>106931657
>>106931627
oh...yes, they love us..
>>
>>106931852
honestly this thread is mainly just a place to repost info from other places like reddit and xitter where the actual competent people are
>>
classic seething
>>
>>106931862
There are no competent people in AI
>>
>>106931828
How much longer until your shift ends?
>>
>>106931702
Comfy always answered questions when asked you drunk pedo
Now go in the containment thread to shill your garbage and spread lies
>>
>>106931866
this
>>
>>106931852
>/sdg/
didn't read
>>
>honestly this thread is mainly just a place to repost info from other places like reddit and xitter where the actual competent people are
Proof?
>>
>>106931873
prove it retard
>>
>>106931862
>where the actual competent people are
/r/comfyui is full of literal retards asking for help for the most basic shit
the discord is no different.

please post examples of these 'competent' people, because i'm not seeing it.
>>
File deleted.
>>106931809
Why not just h.264@CRF1?
>>
why would anon post here if he does not enjoy it? simply close the thread, anonie.
>>
>>106931893
the advice they get is pretty retarded too kek
>>
>>106931862
I feel the same way. I don't have any phobia of reddit or discord, but everything posted here is old news, the real news comes from there, Also, on Discord you get almost instant help from the devs themselves.
>>
>>106931900
I like that this one is viewable in my photo viewer but I'm sure you're right. Is there a node for that by default?
>>
look at these smart people
>>
>106931852
>sdg
>>
>>106931907
Why stay here and cry then? Off you go
>>
>>106931902
>why would anon post here if he does not enjoy it?
who said i don't enjoy shitposting here? just stating facts and the mention of reddit is p funny in itself
>>
>>106931913
>sentient cumfyui
>>
>>106931913
At least they help each other. Even to this day, I still don't have my WAN 2.2 txt2img 1 frame ready to import workflow, and I've been asking for it for half a year now.
>>
>open /sdg/
>n*gbo seething again because the blessed thread exists
Ah so that's where the sudden shitposting comes from
>>
>>106931943
if anything I think people are just done with comfyui which is why there is genuine interest in anistudio
>>
what do I have to prompt to get the hair frame a girls face, i.e. covering the sides of her face, maybe I'm retarded but "hair framing her face" never works for me, or do I need a lora for that
>>
>>106931907
remember when an anon here, on /ldg/, posted the fast cancel for comfy and some little redditor reposted it, and then it was officially implemented by comfy
>>
File: 1740962259794891.gif (3.71 MB, 420x420)
3.71 MB
3.71 MB GIF
>>106931952
>>
>>106931591
>>
File: image_00100_.jpg (670 KB, 1408x2064)
670 KB
670 KB JPG
>>106931943
>I've been asking for it for half a year now.
You had half a year to make it yourself
>>
>>106931954
No, can you link it?
>>
>>106931954
That's literally one case out of thousands. And you don't even know if that anon was a Redditor anyway.
>>
>>106931960
this is my choice of general for shitposting when i want a quick laugh so he's not far off
>>
>>106931943
>WAN 2.2 txt2img 1 frame
literally the same as video but set the frame to 1 retard
>>
did someone piss in d*bos breakfast or why is he multithread seething again?
>>
The ONLY useful thing about Reddit is if a dev is advertising their new custom node. 99.9% of the other posts are garbage.
>>
Spark update: training QIE with two control and an image at 768px is 30/s it. Probably faster if you do a qlora @ 4-bit. But for now just doing 8-bit with 4-bit te quant. Going to try tuning an LLM, Unsloth is optimized for 4-bit so I'll see how that goes, would be nice to make good uncensored Gemma model that isn't just abliterated.
>>
File: RaMu TwIrL.webm (3.92 MB, 852x1280)
3.92 MB
3.92 MB WEBM
>>106931910
Video Combine has several options for output, check that out.
>>
>>106932028
what's useful about another shitty custom node made by a junior and shits up your venv?
>>
imagine spending all day gen'ing basic instagram bullshit of asian models.
>>
is going from wan 5b to wan 14b worth a 1300 euros PC? how is the difference in motion?
>>
>>106932060
some custom nodes are wrappers for new technologies/models like seedvr, radial att, etc.
>>
>>106932073
Wan 5B is not even worth using, at all. The difference is like playing an N64 vs a PS5 game.
>>
>>106931907
>I don't have any phobia of reddit
i never understood it desu, it's just another website out of many to use to get info
>>
>>106932060
this. the entire comfyui ecosystem is a poisoned well of frivolous shit nodes, abandoned repos and holier than thou faggots
>>
>>106932089
and they are 99% broken garbage. what's the point?
>>
>>106932100
because the people who post there are retarded and the entire system has devolved into saying things that are popular and not correct
>>
File: never.jpg (98 KB, 588x815)
98 KB
98 KB JPG
Have you ever stole a prompt?
>>
>>106932115
no they aren't. if they were broken people wouldn't use them.
>>
File: 1760178990768235.jpg (33 KB, 615x378)
33 KB
33 KB JPG
>>106932124
I swear you AI-slop niggers are getting dumber every day.
>>
>anti comfy shill campaign again
I expect a new release (or more realistically an announcement) of JulienStudio soon
>>
>>
>>106932116
too much of a generalization, you'll have to be more specific on what you mean. the same thing could be said about any website including here
>>
>>106932128
we test them and they are shit. the ones you specifically picked are all fucking shit. vibe voice wrapper is also shit. I remember anon going through three different radial attention wrappers, a fucking pain in the ass to get working properly and it either didn't work or is a completely shit implementation
>>
>>106932139
actually no I won't contrarian faggot
>>
>>106932100
>it's just another website out of many to use to get info
normie cesspool of the lowest common denominator mouth breathers with heavy moderation turning it into virtue signaling hivemind. fucking disgusts me.

>it's just another website loool
go the FUCK back
>>
>>
>>106932148
i figured

>>106932154
>normie cesspool
na has the same degenerate faggots as here
>>
>mfw
>>
why are the arguments for comfyui just lies and ignorance? is it that important to just not realize everything will stay shit if we continue supporting it?
>>
idk about any schizos but i don't like comfy because it's fucking annoying to use
>>
>>106932244
based
>>
remember when comfy posted fennec girl with a bag of money after getting $17M in funding and ani was seething uncontrollably
>>
>>106932234
The argument that can't be refuted is why ComfyUI always using network connection after every gen?
>>
>>106932261
no I don't remember, I just find comfyUI fuckign annoying to use
>>
>>106932261
no. if anything it's the grift chink's money
>>
Updated ComfyUI and lost the gallery button, I enabled it in settings but it does not appear... is it over I have to use Comfy with a windows explorer tab to drag the gens in the workflow? All of that because I updated? I trusted him...
>>
>>106932261
Yes...in a local general...how healthy place this is...
>>
>surely if I post the same five arguments again, they'll change their minds
>>
>>106932265
it was over when API nodes hit. of course they would leech your data. the business side of comfy is pathetic. should have been non profit so none of this shit would have a reason to be in it
>>
Updated ConfyUI and a Log In button appeared in the right top corner....is this normal? Not even InvokeAI has this...
>>
File: gen_00042_.png (349 KB, 1024x1024)
349 KB
349 KB PNG
>>
>>106932321
because of it it sends data to Google on startup. you don't even need to login
>>
Anybody run comfyui on their dgx spark yet?
>>
>>106932367
why would I run comfy on my paperweight?
>>
holy fucking shit this fucking mentally ill trani retard shitting up the thread again FUCK
>>
>>106932367
its natively supported, you just install it
>>
>>106932287
new front end UI changes broke some minor things such as crystool's display, lora manager button and other things that should be fixed soon.
>>
>>106932306
>thinking they care about changing anyone's mind
>>
>>106932413
the new front end is just lipstick on the pig and this cycle of breaking shit for no reason isn't going to stop
>>
>>106932234
It took me a while to move to Comfy because of the whole node thing, but I have been here a while and really learned to love it. The past 3 or 4 months have just been a disaster as far as them trying to improve the functionality and ease of use.
>>
File: QwenEdit_00146_.png (939 KB, 784x1328)
939 KB
939 KB PNG
>>106929823
(You)
>>
>>106932435
it's called nightly for a reason. you dont use bleeding edge versions if you want things to be stable you absolute buffoon. by the time stable is updated with the new UI, the custom nodes will be fixed.
>>
Hi trying to run Comfyui on my pc Ryzen 5 3600 RX580 GPU 8GB I'm aware it is a old card and weak Is it possible to get it for AMD instead of NVIDA?
>>
>>106932455
>rx580
dude kys no cap, go to the vrmalet thread for poorfag cope support >>>/g/sdg
>>
Has anyone successfully created a seamless loop with only Wan 2.2 I2V without color burn over time?
>>
>>106932455
it's possible but im going to be honest with you, just don''t bother. either upgrade your pc or stick to online gens.
>>
File: 00025-4049374589.png (653 KB, 896x1152)
653 KB
653 KB PNG
>>
What is currently the best workflow to make a CGI render of a person look realistic?
>>
>>106932455
>8GB AMD
AMD is possible but this may be very painful.
You could probably do SDXL stuff slowly though.
One single qwen edit on a 16gb AMD takes me 3-15 minutes with 64GB system ram.
Search amd on comfyui, rocm, stablediffusion reddits ther are informative posts.
>>
>>106932500
just shove it in qwen edit and ask it to make it real DUH
>>
What version of comfyui is better to use, portable or desktop? I’ve used both and I prefer desktop, but I’m not sure if there is one reason over the other to use a specific one. I don’t have a ton of experience with it overall, but I would like to start learning it more and would like to set everything up correctly the first time.
>>
>>106932507
NTA, but won't you need NSFW loras since it's censored?
>>
>>106932521
desktop has more telemetry
>>
>>106932490
what is it
>>
>>106932522
use loras then?
>>
File: girlsubway.png (1.13 MB, 840x1240)
1.13 MB
1.13 MB PNG
>>106932522
depends.
I've had some success.
This is with only the lightning lora.
>>
What’s the best combo you guys are running with Python + PyTorch + SageAttention + Torch for faster video generation in ComfyUI?
>>
>>106932534
i'm asking if you need to.

>>106932537
how well does it handle something like a penis? I don't see any good loras for dicks yet. they look like shit
>>
>>106932551
>how well does it handle something like a penis?
haven't tried, assuming not well
>>
File: image_00003_.jpg (161 KB, 912x1336)
161 KB
161 KB JPG
>>
>>106932558
do Qwen loras work with Qwen edit? because edit doesn't even have a category on qwen
>>
>>106932533
Gradient Ascent.
>>
I previously used midjourney and it knows SO many styles and is very creative.
Does anyone know a model that is similar capable of knowing so many different artstyles? For digital illustrations.
I used sd, sdxl, flux, qwen and Wan. Best one yet was qwen. But it's still not even close to midjourney.
I am fairly new to comfyui so I am grateful for every point in the right direction
>>
>>106932568
just try them you fucking retard?
>>
>>106932568
I've heard mixed replies to this.
No harm in trying it, it may work.
>>
Is it possible to generate a video in ComfyUI based on a transcript or audio file length?
>>
>>106932580
I will when my 100+ queue of wan gens is done you fucking dumbass that's why i'm asking. you think I didn't think of that genius idea? fuck off

>>106932582
I see
>>
File: image_00006_.jpg (153 KB, 912x1336)
153 KB
153 KB JPG
>>
Any comfy ui model that is equally good as seedream 4?
>>
File: 1705956402513702.png (104 KB, 594x594)
104 KB
104 KB PNG
>comfy ui model
>>
>>106932586
WAN S2V does that but I haven't tried it.
>>
Is it possible to run WanVideo and all other video gen workflow on 16GB vram Video Card?
>>
Is it possible to put the penis inside my ass instead of my mouth
>>
File: image_00008_.jpg (207 KB, 912x1336)
207 KB
207 KB JPG
>>
>>106932619
Yes. Quants.(GGUF)
>>
>>106932618
S2V is shit because it still has the 5 second limit.
>>
File: get out.gif (807 KB, 480x262)
807 KB
807 KB GIF
>>106932602
>>
Is it possible to match the prompt adherence level of chatgpt/gemini/grok with a locally running model?
>>
File: image_00010_.jpg (301 KB, 912x1336)
301 KB
301 KB JPG
>>
i want babby
how do i gen babby
>>
so why is this schizo having yet another melty
>>
>>106932680
Find someone of the opposite sex.
>>
>>106932688
ran "organic" posting again, he'll tire out eventually
>>
File: 1760657675760323.mp4 (903 KB, 992x560)
903 KB
903 KB MP4
;)
>>
File: image_00015_.jpg (252 KB, 912x1336)
252 KB
252 KB JPG
>>
>>106932712
disappointed you just reposted. make another where shit is leaking out of the monitor
>>
File: 00004-116403112.png (2.27 MB, 1824x1248)
2.27 MB
2.27 MB PNG
>>106932455
please get a Nvidia gpu with higher vram. With 8gb of vram, you will have annoying issues with running normal fp16 non-lighting sdxl models when using hires fix and upscaling. make sure you have 32-64gb of either ddr4 or ddr5 ram anon.
>>
File: image_00017_.jpg (188 KB, 912x1336)
188 KB
188 KB JPG
>>
+1 post count so we get closer to a new thread with hopefully less aids in it
>>
File: radiance.png (3.28 MB, 864x1488)
3.28 MB
3.28 MB PNG
>>
>>
>>106932764
kek
>>
https://huggingface.co/lightx2v/Wan2.2-Distill-Models/tree/main

anyone try these? include lora apparently
>>
>>106932777
huh, Res Multistep Linear Quadratic (this gen) looks way better than Euler Beta (last one) on the same seed
>>
>Do some test gens in the NetaLumina model that's being shilled lately
>This result isn't so bad
>Wait, I can turn this NL prompt into Booru tags and render it in NoobAI
>NoobAI result looks better
All roads seem to eventually lead back to SDXL. Which is a shame because NL prompts would be nice.
>>
>>106929823
It stops being your cock after the first frame though.
>>
File: karmara.jpg (75 KB, 1024x1024)
75 KB
75 KB JPG
new character lora (35%, 35%, 30%) mix of three actresses, the features are strong, but i don't think any one of them is that apparent. next experiment is to generate using two women and a man for variety
>>
File: image_00026_.jpg (217 KB, 912x1336)
217 KB
217 KB JPG
>>
File: 1756153239373801.mp4 (1.69 MB, 912x720)
1.69 MB
1.69 MB MP4
>>106932138
>>
File: ComfyUI_temp_yujmj_00012_.jpg (496 KB, 1088x1408)
496 KB
496 KB JPG
>>106931953
>Photograph of a young Caucasian woman with long blonde hair, center parted hairstyle. The hair features face-framing pieces that gently sweep to the sides. Soft, natural waves add movement and volume.
>>
>>106932835
you can't in fact just like, say WHERE in the image shit should be on Noob though, which is what you're conveniently ignoring here. XL also has major bleed issues for multi-concept / multi-character scenes that aren't a problem on newer architectures.

Also were you actually using base Neta Lumina 1.0? If so don't, use the continuation finetune of it, NetaYume Lumina, from here:
https://civitai.com/models/1790792?modelVersionId=2122326
>>
>>106932956
woops I linked an older version, not intentionally, you want the latest one:
https://civitai.com/models/1790792?modelVersionId=2298660
>>
>>106932835
Try forcing SDXL to do two distinct subjects, i dare you. You will either cope with regional prompting or get massive detail bleed and incoherent interactions. Neta is not needed for 1girl slop, but it can do much more.
>>
File: 1730417928707666.webm (3.94 MB, 752x416)
3.94 MB
3.94 MB WEBM
Neta prompt + grok. Not sure if trying to get into local video gen with 12 gb vram is a good idea but grok is a decent free alternative.
>>
File: image_00028_.jpg (567 KB, 912x1336)
567 KB
567 KB JPG
>>
>>106932990
nice, Grok vidya gen is quite good for being free IMO. Wish the resolution was a bit higher, but the way it just kinda does its own thing at first often gives good results without you needing to re-run with a custom prompt, which is pretty convenient.
>>
>>106932956
I'm using NetaYume Lumina v3 which was released ~9 days ago according to Civitai. I'm aware of SDXL's limitations, believe me.
Maybe it's just this one particular prompt that it's having trouble with, but the problems seem to boil down to a lack of diversity in training data rather than the strengths of the algorithm itself.
>>
>>106932777
>>106932807
lora pls?
>>
>>106933015
yeah, the normies are learning about I2V with this model
https://xcancel.com/stillgray/status/1978741461551055030#m
>>
File: 1755679011935250.webm (3.91 MB, 464x688)
3.91 MB
3.91 MB WEBM
>>
File: 1741120699535878.mp4 (1.54 MB, 1040x720)
1.54 MB
1.54 MB MP4
>>106932760
>>
File: 1744236126570703.jpg (1.12 MB, 1536x1536)
1.12 MB
1.12 MB JPG
>>
>>106933081
>has more spaghetti on the plate after the splash
uhmmm consistency bros????
>>
File: 1737682547763790.png (968 KB, 1457x1546)
968 KB
968 KB PNG
https://xcancel.com/LodestoneRock/status/1979382717210071515#m
Ostris implemented ramtorch, has anyone tried it?
>>
File: jewfyui.mp4 (637 KB, 624x400)
637 KB
637 KB MP4
>>106932742
what about treason for chinese gold
>>
>>106933126
>treason for chinese gold
he didn't implement HunyuanSlop 3.0 though, he's a Nvdia shiller, not a chink shiller
>>
File: image_00033_.jpg (327 KB, 912x1336)
327 KB
327 KB JPG
>>
>>106933142
he is a crony for the Shanghai mafia
>>
File: 1748328432781160.webm (3.94 MB, 560x560)
3.94 MB
3.94 MB WEBM
I cannot even imagine what indie anime people are going to create within 5 years. Some dedicated autist will probably adapt Berserk better than any of the existing adaptations.
>>
File: dmmg_0032.png (1.32 MB, 896x1152)
1.32 MB
1.32 MB PNG
>>106933121
gonna try this out tomorrow, i got three more things to train today
>>
Neta's body proportions are all over the place too. I'm giving it a chance but I dunno, it has some issues.
>>
>>106933174
>Some dedicated autist will probably adapt Berserk better than any of the existing adaptations.
it'll get nuked because of copyrights lol
>>
>>106933142
>Nvdia shiller, not a chink shiller
chink jensen (nvidia), is literally chink su's (amd) cousin
>>
File: image_00038_.jpg (590 KB, 912x1336)
590 KB
590 KB JPG
>>
>>106933235
It's pretty impressive that we can generate animations of still images moving around in a spatially-coherent way at all, but video models have a long, long way to go.
>>
>>106933306
Nvdia and Amd are US companies though, China steals a lot of things, but they can't steal Nvdia, that's for burgermen
>>
>>106933235
what artists are you using (if any)? Some are certainly less stable than others.
>>
File: 1752973781951577.jpg (1.19 MB, 1248x1824)
1.19 MB
1.19 MB JPG
>>106933235
Body proportions heavily depend on your prompt and the art style. It does have an issue with melting body parts, i assume thanks to being undertrained.
>>
>>106933333
>>106933335
I'm not using any artist tags, any recommendations? I am but a humble 1girl gooner trying to generate sexy pictures of smug-looking bitches, which is another limitation I'm running into: either it can't understand facial expressions very well or it can't generate facial expressions that differ from how a given character is usually depicted.
>>
>>106933023
you must mean 3.5 if "released 9 days ago is true"
what is the prompt, anyways? and like what sampler / scheduler / etc are you using

no one is saying it's like perfect quite yet anyways but it's definitely annoying to see people dismiss the clear advantages of better architectures. That's how we wind up in this endless cycle of "when new thing" -> "new thing comes out" -> "not nearly enough people make any attempt to work on / with creating resources for it or training it more"
>>
no more shilling, chink or fagmerica. shut the fuck up
>>
File: 1741357174359393.png (1.02 MB, 1360x768)
1.02 MB
1.02 MB PNG
>>
>>106933323
>are US companies
he doesn't know
>>
localsissies, they're making fun of us :( >>>/wsg/6003112
>don't spit on veo. local is worse. on 2025 local, we still applaud for gens with basic poses... local is like ps3...
>>
>>106932324
is comfyUI actually compromised in that it sends metadata or actual prompts to external servers?
>>
Friends, several months ago while lurking I stumbled upon a very powerful yet paywalled frontend. Their cheapest tier allowed using their software for $5 (probably without subscription?) but higher tiers allowed something more powerful, don't remember what. It allowed for voice/video gen, some vr features. Actually I think I found it in the archives, in some vr related thread on 4chan.
Name probably starts with v.
Does anyone know what I'm talking about? I'm just curious about it's state.
>>
>>106933024
I trained it myself, it's not on any site. Here:
https://mega.nz/file/Q2ITwYAR#86GEr-fyEardi0ykwQhgKqCtnIOdWarfe5Tvnwx3f1M

Run it around 0.8 - 0.9 strength. You'll want to use a typical Chroma schizo negative along the lines of e.g. `aesthetic 0, ugly, blurry, pixelated, jpeg artifacts, out of focus, unfinished, lowres, depth of field, bokeh, sketch, greyscale, monochrome, traditional media, illustration, 2d, 3d, cgi, render, cartoon, anime, painting, painterly, drawing, watercolor, disfigured, deformed, fused, conjoined, bad anatomy, bad proportions, bad composition, bad perspective` or it won't look good (not an issue with the Lora, Chroma just isn't realistic at all without that kind of negative).

Was captioned with jailbroken Gemini 2.5 pro natural language description using her name in the sentences where it made sense, so prompting it that kinda way works best.

Good baseline sampler setups are Euler Beta or Euler Ancestral Beta @ CFG 5.
>>
>>106933456
if it acually was someone would immediately make a ragepost on the stable diffusion reddit with hard evidence, I feel like
>>
>>106933453
>#9
damn i have some vids to catch up on
>>
>>106933481
(samefag) oh yeah I should have noted too, as far as which version it was trained on, it was the "official" final release of HD, like the same one that's one the CivitAI page here:
https://civitai.com/models/1330309/chroma
>>
File: truck.png (3.21 MB, 1696x1296)
3.21 MB
3.21 MB PNG
>>
File: 00043-3742879360.png (143 KB, 896x1152)
143 KB
143 KB PNG
>>
>we
>>
>>106933530
>wuz
>>
>>106933353
Just good old trusty Euler Ancestral. Anything else tends to result in even more fingore than normal.
I'm not dismissing the model, it looks promising, but it needs more training. I guess. I'm still new.
>>
this is probably a dumb question but when you're doing img2vid do you want the source image to be as high resolution as possible or do you want it close to or the same as the video resolution?
>>
>>106933348
https://neta-lumina-style.tz03.xyz/
>>
>>106933646
Bookmarked, ty
>>
>>106933456
>actually believing the ldg schizo lies
ngmi
>>
>>106933553
with what Scheduler? anyways give DPM++ 2S Ancestral @ CFG 4.5 a shot, using Linear Quadratic as the scheduler. I've mostly found that to be the best performing overall. Res Multistep @ CFG 5 again with Linear Quadratic is also worth a shot.

In my experience Euler anything is almost always WORSE when it comes to Neta.
>>
>>106933662
Oh sorry, simple scheduler. Alright I'll give that a shot, thanks. I tried res_multistep with linear_quadratic as suggested on civitai and it gave me a lot of mangled hands.
>>
>>106933688
knowing what this specific prompt you're struggling with actually was would help also honestly lol
>>
>>106933700
also yeah I don't think the Simple scheduler would ever be a good choice regardless of sampler for Neta, even Beta is almost certainly gonna give better results. Or Normal or SGM Uniform. Simple is useless really on most models IMHO, it's "recommended" by base model creators in the same way DDIM was "technically" the default for SD 1.5 and such.
>>
>>106933453
>>>/wsg/6002417
i need more dumb shit like this lmao
>>
>>106933453
>>106933719
>API can do violence and we can't
>>>/wsg/6003183
are we sure the localkek community is the "freedom" community? doesn't look like it
>>
Cant bake, someone else
>>
>>106933481
>>106933511
thanks, anon
I'll test later
>>
File: ComfyUI_150708_.webm (980 KB, 832x480)
980 KB
980 KB WEBM
>>106933819
to be fair wan2.2 can do violence but the quality is p shit would be better with a lora. the wan kick one was one of my favs.
here's an old one off gen no loras of 2.2, prob could get some decent stuff if you tried. though you're not going to get it if you're using the low step loras and 1 cfg. was the same with 2.1 too.
>>
File: 1747188227860553.png (497 KB, 1206x1599)
497 KB
497 KB PNG
>1.58bit models will be the future
GROK IS THIS TRUE?
https://xcancel.com/_akhaliq/status/1979209909444001822#m
>>
File: 1673230897700.jpg (97 KB, 1920x1440)
97 KB
97 KB JPG
>>106934062
>>
>>106933456
https://github.com/comfyanonymous/ComfyUI/blob/9da397ea2f271080406f0c14cf4f0db7221ddf70/comfy_api_nodes/apis/client.py#L297
>>
>>106934228
that's just code to check internet connection bro, are you dumb?
>>
>>106934228
it just checks if your internet can open up google, if it does, the code considers you have internet on, that's all
>>
>>106934238
if there is a connection, it sends data to Google you stupid fuck. enjoy local being a second class citizen in "the best" local ui. what a fucking joke
>>
>>106934245
>it just checks if your internet can open up google
why?
>>
>>106934254
to see if you have internet, if you don't, the code won't bother doing API shit, it's a waste of time
>>
>>106934247
>if there is a connection, it sends data to Google you stupid fuck.
it sends nothing retard, look at the code
>async with session.get("https://www.google.com", ssl=self.verify_ssl)
it just opens up google, that's all, it's not sending anything
>>
>>106934254
so it knows what to do if you try to use any of the many valid nodes with internet connectivity? Why is this so difficult to understand lol? Even prior to the API nodes stuff there's ALWAYS been Comfy nodes that use your connection to pull down needed models directly from HuggingFace when necessary for the sake of convenience.
>>
>>106934298
>it sends nothing retard,
wireshark says it sends data to Google and I trust that over you
>>
>>106934300
why is it needed if I disabled API nodes? why doesn't it check for a connection when I actually try to log in instead of startup?
>>
>>106934303
post pic that validates you saying what WireShark says is ACTUALLY what you claim it is saying
>>
>>106934313
I do every time and you stupid fucks conveniently forget every fucking time. how about you post a fucking screenshot of wireshark detecting it instead of asking me. FUCK YOU
>>
File: 1696378387900808.gif (3.53 MB, 605x600)
3.53 MB
3.53 MB GIF
I got ComfyUI and Wan 2.2 going but my container crashes on this last step using the ti2v workflow. Is it not enough to have 16GB of RAM available? I have a 4080 Super, this shouldn't crash yeah? I have CUDA and everything installed in the container.

>got prompt
>Using pytorch attention in VAE
>Using pytorch attention in VAE
>VAE load device: cuda:0, offload device: cpu, dtype: torch.bfloat16
>Requested to load Wan
>VAE loaded completely 8961.125 >1344.0869674682617 True
>Requested to load WanTE
>Model loaded completely 9.5367431640625e+25 10835.4765625 True
>CLIP/text encoder model load device: cuda:0, offload device: cpu, current: cuda:0, dtype: torch.float16
>model_type FLOW
>Requested to load WAN22
crash. I see my VRAM is at 95% when it crashes but I thought 16GB was enough to run the 5B? I'm just playing around with the Wan sample workflows and just got ComfyUI manager installed.
>>
>>106934312
IDK, probably just for the sake of caching availability as a value or something. But again it's still not really anything new, there's NUMEROUS perfectly valid community nodes that would just YOLO try to download shit from hugging face if needed without actually even validating whether the internet was available, already, and presumably just hard fail if it wasn't.
>>
>>106934312
this. it's pretty predatory behavior
>>
>>106934303
>the code has only "session.get"
>to send anything it has to be "session.post"
can't believe I have to explain something this basic to retards like you but here we are
>>
>>106934324
i'm here like every day and i've never one time seen anything remotely like that posted
>>
>>106934328
you got in at the wrong time
comfyui is completely shit now
>>
>>106934340
ah fuck, good thing I didn't start learning it. What's everyone else using with Wan models?
>>
is there an official guide to qwen image edit?

also, does gguf work better at using less memory, with qwen image edit?
>>
>>106934336
why do I have to explain wireshark detecting data being sent to Google on startup every time? you are a shill doing damage control so they can keep selling data. fuck you
>>
>>106934350
Yes I know there's a wf gallery thing. I don't want to get an inferior version.
>>
>>106934349
just use wan2gp. it's retard proof
>>
>>106934353
just copy paste the code on chatgpt and he'll tell you it doesn't send anything, I don't know anything about your wireshark meme, I know how code works you retarded fucker
>>
>>106934364
just install and run wireshark so you can see you are retarded
>>
>>106934373
>just trust wireshark bro, it is the ultimate minister of truth
or... you could read the code (if you can read) or ask chatgpt to read the code and tell you if there's something that sends anything (and there's none)
>>
>>106934379
what the code does is more important. if it is sending out data to a third party that is the problem. I honestly don't give a shit what you think, I won't stop letting people know comfyui is compromised by glowies and chinks
>>
File: 1755734599938199.mp4 (945 KB, 640x640)
945 KB
945 KB MP4
https://huggingface.co/lightx2v/Wan2.2-Distill-Models/tree/main

try high/low has the updated 2.2 loras embedded, 109 seconds with 4 steps.
>>
>>106934411
>if it is sending out data to a third party that is the problem.
but it doesn't, do you know how to read code or something? you're just claiming something and you have no proof of anything, are you fucking retarded? it says GET not POST you fucking nigger
>>
File: 1735174600296757.mp4 (1.07 MB, 640x640)
1.07 MB
1.07 MB MP4
>>106934417
116s

pretty gooddsdsd
>>
>>106934417
I don't get it, the full model is just wan 2.2 with the lora added right? (the lora that was released a few days ago)
>>
>>106934438
>captcha not required
thx for not working tab btw like im having a seizure
>>
>>106934353
you are a paranoid schizo trying to push fairly obvious lies that don't make any sense without even using vaguely believable evidence as a prop, fuck you.
>>
>>106934443
it has the loras in it and doesn't have the weird hallucinating effect that the lora itself had when used with wan 2.2 high/low

also, seems pretty fast
>>
>>106934458
I'm not downloading the full model, the lora should work by itself, like it did on their previous loras, they fucked up something, and they have to fix it
>>
File: 1751836634448798.mp4 (1.09 MB, 640x640)
1.09 MB
1.09 MB MP4
>>106934458
with rife vfi interpolation, 112 seconds (less without interpolation, film vfi is a bit slower). pretty good, still want 2.5 though but this is very capable.
>>106934468
well it is unfucked now, plus gen times are faster with this distill version. worth a try.

using the high/low that end in comfyui not sure if the others are different at all.
>>
File: r.png (3.33 MB, 864x1488)
3.33 MB
3.33 MB PNG
>>106934328
I do think 16GB is generally enough to run the 5B but I really don't know how much you'll use exactly when in the container. maybe you get a RAM usage spike when the VAE converts the latent space or something.
>>
if anyone wants pretty good Qwen Loras for:
- Marjorie Taylor Greene
- Alina Habba
- Lauren Boebert
- Jasmine Crocket
- Laura Loomer
- Rubi Rose ("not like the others" thematically yes lol, I just haven't finished other normal celeb ones yet)

I trained / have all of these, but there's not really a good site where you're allowed to post them anymore. I can mega.io any of them if anyone wants though.
>>
>>106934438
It's that guy blue hair Mike
>>
>>106934491
i'm interested in the choice of training settings, do you have an example?
>>
>>106934422
>>106934455
I trust wireshark over you. shut up or snap a screenshot of it not sending data on startup. otherwise fuck off
>>
>>106934508
Mike Hatsuno
>>
handsome bloke
>>
>>106934518
>I trust wireshark over my own eyes (aka reading the code by myself)
buy an ad
>>
>>106934488
I think I do. I tried lowering samples and such but I can't squeeze a proper gen ;_;
>>
>>106934488
I absolutely failed at getting 16gb to work. I succeeded (it wasn't nice) with 32gb. 64 is fine.
>>
File: 1735344622685006.mp4 (953 KB, 704x480)
953 KB
953 KB MP4
https://huggingface.co/lightx2v/Wan2.2-Distill-Models/tree/main

using high/low, 4 steps, no loras (it's in the model/merged)
>>
File: QwenSettings.png (32 KB, 1273x576)
32 KB
32 KB PNG
>>106934513
i've been using TensorArt just to get stuff done around a kravillion times faster than I would otherwise. This stitched-together pic basically shows the gist of the settings I always use.

I think the one thing not seen there is that the number of Restarts was set to 3. And also I don't personally ever use repeats, I just set it to 1 and increase epochs to whatever, I find this gives more predictable results.

Also batch size displays as "nothing" there but it's technically set to 1 behind the scene on TA for Qwen, basically I guess they just use Gradient Accumulation Steps in lieu of batch size on certain models in TA's trainer in order to let you get nearly the same thing as actually increasing batch size would, but with way less risk of their training systems OOMing during the run.

So like that is to say, theoretically if you WERE acctually using batch size normally when training Qwen in whatever other trainer, you'd have IT at 4 but Gradient Accumulation steps at 1 or nothing, so like basically the reverse of what's in the pic. Hope that makes sense.
>>
>>106934580
>slow mo
they'll never be able to fix this won't they?
>>
File: 1740492399720269.mp4 (1.05 MB, 704x480)
1.05 MB
1.05 MB MP4
>>106934580
>>
>>106934597
livleak archive lora when
>>
>>106934583
interesting. i'll see if i can use this in another trainer. so about how many epochs (image count =?) or steps does it take with these settings? the lr is lower than the one I attempted and it didn't really converge in 3000 or so
>>
>>106934554
So I should upgrade my RAM? I don't see my memory hitting 100% but I am using a custom 64gb page size. To me it looks like it's VRAM, I guess I can't run local diffusion of these models on my 16GB. I see the model loads fine but the workflow crashes the container at KSampler, sometimes it gets past and crashes at VAE Decode and I get told it's hitting RAM limit switching to Tiled VAE Decode.
>>
>>106934630
Maybe not, but I literally pulled in 16gb from another machine (so I had 32gb) and boom it worked. So I got 64 lol. And others point out I should probably max mine to the 128 (but for LLM).
>>
File: 1744471352833207.mp4 (916 KB, 704x480)
916 KB
916 KB MP4
lmao the chat is scrolling even

I like the wan 2.2 distill update. 98 seconds for 4 steps (no loras, it's baked in or whatever), that's pretty fast.
>>
>>106934530
it's obfuscated. you can't tell me what it's sending but I can tell you it's sending data. maybe go fuck yourself and learn op sec
>>
>>106934630
maybe you can make it work with better offloading/ram clearing/tiled decode from the start/gguf quants

but also sure, more VRAM, RAM and everything is good. we all pretty much don't have too much of that.
>>
File: 1751869657783533.png (227 KB, 500x378)
227 KB
227 KB PNG
>>106934743
>I can tell you it's sending data
>>
>>106934732
>it knows
>>
hmm...the lora combo may be better

high: 2.2 kijai MoE lora 1 str + 2.1 lightx2v 3 str

low: wan 2.2 lightning low 1 str, 2.1 lightx2v 0.25 str
>>
File: 1750899477582988.mp4 (837 KB, 480x704)
837 KB
837 KB MP4
>>106934810
see, more motion.
>>
NEW THREAD
>>106934820
NEW THREAD
>>106934820
NEW THREAD
>>106934820
>>
>>106934756
I guess I thought I was set with 16GB vRAM and 32GB RAM. I can easily upgrade the latter but if the issue is not enough vRAM from my GPU I'm kinda cooked. That's a new PSU for a second GPU or new GPU to begin with. I didn't realize my rig couldn't handle this. Are there other more memory friendly software compared to ComfyUI?
>>
>>106934891
no, not really. comfyui might even be the one where you can do most with custom nodes and some experimentation right now.
>>
>>106934891
* i do think you can make most things work on your specs BTW

but you are going to use some tiled vae here, unload stuff like a text encoder or unet at times, and maybe balance some system ram vs vram on comfui-multigpu or something. it won't always justwerk without anything.
>>
Someone make a proper thread wtf
>>
>>106934919
No that makes sense, I've actually already began using tiled vae and I'm moving now into implementing cpu offloading for text etc. Thanks anon.
>>
>>106934932
sure. i think it'll work when you do at least some of this manual management with custom nodes. if you know where specifically you hit OOM, some people might know what they use there specifically. we nearly all don't have quite enough VRAM at least. good luck.
>>
File: uh oh.webm (456 KB, 320x256)
456 KB
456 KB WEBM
>>106934957
It worked, using cpu for the CLIP and tiled VAE decode but.. holy shit I've created an abomination. I wanted her to open her arms and this is what I got...
>>
>>106935027
trippy
>>
>>106934629
20 - 40 epochs is like "deece" with 1 repeat. It kinda slow burn improves almost indefinitely with these settings though, so as many as you want generally will be better, there's not really a point where it starts to massively and clearly degrade as on a distiled model like Flux



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.