[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev: >>106691532

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2203741
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbours
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
But how many watermelons can she physically carry?
>>
File: AnimateDiff_00001.mp4 (1.56 MB, 704x704)
1.56 MB
1.56 MB MP4
Look at that sucking fidelity of the sausage and bread, god damn.
>>
>>106696190
>>106696196
illustrious is all you need for 1girl animeslop, stop acting like you need cutting edge tech to do it, you subhumans.
>>
>>106696247
>and Neta Lumina 1.0 was always quite objectively better than stock Illustrious 0.1
hey i like neta as much as the next anon but it was never better than il kek
>>
>>106696295
But can Illustrious draw anime girls watering a potted plant with a watering can?
>>
File: qwenpls.jpg (1.54 MB, 3243x2716)
1.54 MB
1.54 MB JPG
Is qwen image edit really as good as people make it to be?
like could you transfer the the pose and clothing of the character on left to the one on the right?
I tried to fuck around with img2img and controlnet but the results were kinda meh.
>>
File: AnimateDiff_00001.mp4 (972 KB, 480x544)
972 KB
972 KB MP4
I've managed to glitch out this workflow, I'll make use of it.
>>
Why bother continuing the thread? Local is over.
>>
chorma bros, how do you get controlnet to work with chroma?

>>106696288
fucking kek
>>
Blessed thread of frenship
>>
File: 1733134956643650.png (1.89 MB, 1344x1728)
1.89 MB
1.89 MB PNG
gen more pepes

>>106696295
>>106696305
neta yume simply lacks fully-baked art styles. artist tags that are standard and work great in illus or noob are fucked up in neta. it needs significantly more training. the prompt comprehension is obviously superior though.
>>
File: 1737758548794681.png (2.04 MB, 1024x1552)
2.04 MB
2.04 MB PNG
>>106696295
it's about convenience, not having to run detailer + upscaler, being able to use nlp along with tags, I mean the only advantage illu has are all the char loras and cnet support.
>>
>>106696319
what the FUCK did i just get kek

prompt was
>1girl, potted plant, watering can, holding, watering plant, outdoors
>>
File: AnimateDiff_00001.mp4 (1.35 MB, 704x704)
1.35 MB
1.35 MB MP4
>>106696336
Surprisingly high quality once done properly. They must have trained on a lot of cat videos.
>>
>>106696396
this looks like fucking shit though, you have low standards
>>
>>106696414
yeah this one came out like garbage
>>
>>106696400
why are you genning cats that look like shaved ballsacks?
>>
>>106696429
i know im coming off as rude but these really dont look good. maybe after a few finetunes and loras. the issue (for you), is that almost no one is going to bother switching from illustrious/noob or even bother using something other than wainsfw
>>
File: WAN2.2_00025.mp4 (3.87 MB, 616x840)
3.87 MB
3.87 MB MP4
>>
File: WAN2.2_00026.mp4 (3.53 MB, 632x816)
3.53 MB
3.53 MB MP4
>>106696445
>>
File: sneed.jpg (1.63 MB, 2048x2048)
1.63 MB
1.63 MB JPG
>>106696367
Lurker, dropping to say hi. Anon, I know we're in a shitty phase, but try not to get stuck in the tarpit of retards that are so eager to ruin the thread. The tech is so cool, isn't it almost a time machine? Even if most only use it to stroke their dicks to masterpiece, 1girl, standing. I think people will see the light once we get better coherence and prompt comprehension. To that deranged anon who always gets banned, thanks for namedropping Thucydides, I should spend more time reading books instead of shitposts. Stay classy, lmg!
>>
File: AnimateDiff_00001.mp4 (2.33 MB, 704x896)
2.33 MB
2.33 MB MP4
"a beautiful woman spots the camera and her expression changes to disgust, looking at the camera with disgust."

Geez, the resemblance changes so fast, no bueno.

>>106696430
Going through folders with images I've saved throughout the years, testing the overall quality of my workflow.
>>
File: WANI2V__00187.mp4 (1.85 MB, 1320x696)
1.85 MB
1.85 MB MP4
>>106696337
Why bother breathing? No one likes you.
>>
>>106696492
is that a reference to kiernan looking at the camera and grimacing in disgust?
>>
File: 1736858097716101.png (123 KB, 1156x766)
123 KB
123 KB PNG
Anyone else having lora issues when using lightning loras?
I get picrel every time for some reason.
I use these :
https://huggingface.co/Kijai/WanVideo_comfy/tree/main/Wan22-Lightning
>>
does anyone here bing watch ai videos on youtube?
https://www.youtube.com/watch?v=y-zENxLijmw
https://youtu.be/u-DFAq51mlI
https://youtu.be/e5mvqpvErhs
https://youtu.be/WMW80i4Nr1g
https://youtu.be/Nq7Xt1JVfGU
https://youtu.be/G5vO8zK9Roc
https://youtu.be/V4zwIhS2iZk
https://youtu.be/RGbkTZPe4Sk
https://youtu.be/YH3Tgm9EmQk
https://youtu.be/Grw6ytFdGkk?list=RDMM
>>
File: file.jpg (27 KB, 478x323)
27 KB
27 KB JPG
any recommendation for parameters to use with NAG?
>>
>>106696532
i can honestly say i would rather kill myself than ever watch any of that slop. the thumbnails alone make me rage. for anyone consuming this rot: kys right now to death until you die.
don't give these tardfags revenue.
>>
File: WAN2.2_00029.mp4 (3.87 MB, 632x816)
3.87 MB
3.87 MB MP4
>>106696492
same prompt
>>
>>106696532
ew no
>>
>>106696532
how can local compete?
>>
the api fag is awake. time to evacuate the thread. see you retards in 10hours
>>
File: AnimateDiff_00001.mp4 (2.43 MB, 704x1088)
2.43 MB
2.43 MB MP4
"a beautiful womans eyes turn fully black and black veins spread from the eyes all over the face as he mouth opens wide and her expression turns neutral."

Not impressed.

>>106696517
Man of good taste.

>>106696557
Yeah it's not too good at proper disgusted expression.
>>
File: WAN2.2_00031.mp4 (2.42 MB, 632x816)
2.42 MB
2.42 MB MP4
>>106696593
bs prompt adherence
>>
>>106696528
From kijai himself: https://github.com/kijai/ComfyUI-WanVideoWrapper/issues/909
>>
>>106696544
i don't get it, you want to generate ai content(images/video) but you never intend to watch any ai videos and consider them slop regardless of effort and quality that goes into them?
>>106696571
i suspect some off these video have some aspect of local foss model usage involved. Its good to study how other people outside diffusion threads are using this shit. I rarely if ever see any decent stitched together video of ai generated clips made strictly from FOSS models. People want to talk a big game of advocating and supporting foss models but never post their shit outside this echo chamber for the others to see.
>>
>check previous bread
>ani third person shilling like a madman
I can respect the hustle
>>
>>106696664
>and consider them slop regardless
absolutely. the fact you watch these tells me that you are blind.
but you're just posting them here to shill them so double fuck you.
>>
>>106696686
His delusion makes it sad, who would invest in his project in it's current state?
I'm trying to figure out the value proposition when both comfy and voldy had high metrics before that even was a discussion?
I think he's going to Japan because he's lost and wants something to do but is cope lying like he always does
>>
is a8r8 still required for regional prompting or are native nodes good enough in comfy?
>>
desu my favorite posts are the ones that go "well ive never seen it so it must not exist"
>>
>>106696659
Thank you anon.
>>
>>106696664
i agree with you for the most part but it's the same with local slop in that you have to shovel through the shit to find the good stuff. this place just heavily leans towards gooners so you're not going to see much outside that here.
there's plenty other communities though so it's not a bad thing.
>>
for wan 2.2 without light loras, how many steps should i do? 20 high and 20 low?
>>
i bet you this is spyware, there is zero reason to ever use this https://github.com/PastLifeDreamer/Pocket-Comfy
>>
File: 1728248622992062.png (3.03 MB, 1328x1552)
3.03 MB
3.03 MB PNG
>he boughted?
>>
>>106696694
i wasn't really shilling those video in particular. I just wanted to show off other shit i found interesting that wasn't exclusively gooning fap material and meme gens.
https://youtu.be/0y1xGZ5LRns
>>
>>106697023
I love the way lumina handles both lineart and brush strokes. I was under impression yume burned that out of it in favour of slop, glad to see it didn't.
>>
>>106697072
problem is the fucking fishnet, which needs a detailing pass (which im not gonna do because fuck that noise desu)
>>
>>106697059
cool video but horrible editing and music choice
>>
>>106696328
>Is qwen image edit really as good as people make it to be?
no, it's ass lol
>>
https://youtu.be/_2pmL1Kpsqw
https://youtu.be/UsYO1deUwN4
https://youtu.be/eoNWXB_Y6L8
https://youtu.be/Vb80WK3GQZc
https://youtu.be/nFdmO-Ju2YI
https://youtu.be/FOQy4LSFLhg
https://youtu.be/kVilXt_dO_U
https://youtu.be/P1puNkGgUS4
>>
> adds youtube to the filter

fuck you
>>
Qwen Image Edit excels at editing only the images that were rejected by Gemini. I also like that it can be easily integrated into workflows in the future.
>>
>>106696988
15H/20L usually to get a good result, but it depends on the sampler
>>
File: 1728946528720707.png (13 KB, 850x178)
13 KB
13 KB PNG
Why call the trigger "nippull", why not use natural language like "nipple pull", why the fuck so many loras have trigger words like this
>>
>being this desperate
>>
>I just had a banana pudding
>>
>>106697352
There's native tags that will mess it all up.
>>
I'll ask again if there's any way to get NAG to work with these nodes. They are simply too useful not to use.
>>
i am a new poster here
what is the difference between ldg and sdg?
>>
>>106697329
can I ask what sampler you'd recommend for realistic gens? I've only been using euler ancestral so far
>>
>>106697477
Pretty sure unipc is the correct sampler for wan 2.2. Don't know why every workflow defaults to euler
>>
>>106697477
I used to use unipc_bh2/simple but I moved to res2m/bong_tangent
both work fine
>>
File: hyimg.jpg (535 KB, 895x895)
535 KB
535 KB JPG
https://xcancel.com/TencentHunyuan/status/1971230160604311832

If this saves local T2I and is Seedream tier, it should ease my pain of losing Wan 2.5 a bit (but not completely)

The chinese leaker on twitter has been talking about this model all month claiming it's a big model and nano banana tier
>>
>11min 720p gen at 12 steps and fp32 for vae

I see a significant improvement from fp16, the small blurry details tighten up.
>>
>>106697494
>...Commercial-Grade...

its gonna rug pull, isnt it
>>
Reminder that someone on the Hunyuan team is a weeb and some of their older models knew a lot about specific anime characters and artists. I remember someone posting an imgur album showcasing it.
Although I am not sure it's still the case with recent models
>>
What does local t2i need saving from?
>>
>>106697491
do those two need a certain step number?
>>
>>106697556
stagnation I guess
this is a rapidly evolving industry. Well it isn't, progress is slow and underwhelming, but gotta maintain the facade that shit is actually happening or the cash flow stops
>>
>>106696288
Why does that guy look so angry?
>>
>>106697566
>stagnation
i dunno sounds like an issue of skill
>>
File: 1734695708144882.png (1.17 MB, 912x1136)
1.17 MB
1.17 MB PNG
the anime girl in image1 is wearing the outfit of the girl in image2.

qwen edit v2 is so neat, the multi image stuff works well and you can just reference the image by node or, describe it.

the ff image was potato quality and I didnt upscale it first, but you get the idea.
>>
File: 00001__4264651066.jpg (1.09 MB, 2232x2656)
1.09 MB
1.09 MB JPG
how the heck do you manually install triton and sage attention on linux? i used the ComfyUI-Easy-Install script on windows which includes sage and triton but the linux script doesnt include it. this is what i used
>>
>>106697553
hunyuanimage was just a decent weeb training away from better looks, the tech under the hood wasn't bad.
>>
>>106697494
>sneedream
why would anon want a piss filter tho
>>
>>106697576
well you can't mean controlling time so I guess you mean patience
I've already said what happens if progress isn't made sharpish
>>
does qwen edit 2509 use the same workflow as old qwen edit? Just replace the model?
Why mine doen't work?
>>
>>106697559
unipc_bh2/simple -> 15/20 was working fine, didn't test lower
res2m/bong_tangent -> I went with lightning lora with them, since I was tired of waiting, I use a weird number of steps to get rid of slowmotion and oversaturation coming from the speed loras, so it's not relevant for you
>>
>>106697430
kijai has it's own implementation of nag, did you even remotely try to look for it? ffs.
>>
File: 1745140987101478.png (1.13 MB, 816x1272)
1.13 MB
1.13 MB PNG
>>106697585
the yellow hair anime girl in image1 is wearing the outfit of the girl in image2.

nijika -> miku
>>
>>106697502
>720p gen at 12 steps
using lightx2v?

>>106697502
>fp32 for vae
kind of a no brainer for me, it doesn't slow down anything
>>
File: 1748703418743929.png (1.22 MB, 1360x768)
1.22 MB
1.22 MB PNG
>>106697624
the yellow hair anime girl in image1 is wearing the outfit of the girl in image2. keep her face and yellow hair the same.

this would take ages to do with inpainting or masking, normally. edit models are neat, and supplement regular image or video gen models. also note how the edit respects the subtitle text, that'd be hard to do even with a decent masking extension.
>>
>>106697616
But that is the kijai node?
>>
>>106697589
>manually install triton
In your comfy venv, pip install triton.

>>106697589
>sage attention
I compile it myself from their github, if you'd like I can share the wheel with you, but it depends on your gpu. I compile it for 3090/4090/5090.
>>
>>106696593
It is a pretty funny video. Made me laugh.
>>
>>106697650
can you try a nude anime girl and get her to wear underwear?
old qie was completely shit at that
>>
>>106697724
you can do literally anything with the qwen clothes remover lora, adding or removing clothes.
>>
>>106697570
I'm pretty sure it found some WWE footage because of the walking in hallway(backstage) and the increasing light(on camera light).
>>
>>106697735
I meant anime style swap in particular, does the lora with it?
>>
>>106697746
the lora removes any "dont lewd this" constraints so you can swap on/off any clothes you like, even with an image reference (image2, etc)
>>
>>106697761
oh nice, and it works with the new qwen version too?
>>
File: 1745536881104752.png (1.24 MB, 1360x768)
1.24 MB
1.24 MB PNG
>>106697773
it should be fine
>>
File: file.png (28 KB, 457x369)
28 KB
28 KB PNG
>>106697693
uninstall comfy right now forever.
>>
>>106696328
worthless pedotroon
>>
File: Comparison.jpg (3.62 MB, 3072x1024)
3.62 MB
3.62 MB JPG
>>106696305
Yeah it was lmao, in no way shape or form was Illustrious 0.1 as originally released somehow more clearly viable for community fine-tuning than Neta Lumina 1.0. Here's a comparison of Illu 0.1, Neta 1.0, and NetaYume 3.0, all on the same positive prompt / negative prompt / seed. As you might guess, what the prompt actually says looks absolutely nothing vaguely like what Illu produced lol.
>>
>>106697781
I'll try it, thanks anon
>>
File: 1744537703890637.png (1.19 MB, 1360x768)
1.19 MB
1.19 MB PNG
>>106697781
the pink hair anime girl on the left is wearing a baseball cap that says "BOCCHI" in stylish black text, a black leather jacket, black sunglasses, and a white t-shirt that says "LDG".

so cool, it's like inpainting + controlnets but evolved. BUT you can also use controlnets *with* this model to make even more neat stuff.
>>
File: 1738515188609522.png (1.17 MB, 1360x768)
1.17 MB
1.17 MB PNG
>>106697824
*fixed, hair was a bit off in that gen.

also with easy outfit swaps there are endless possibilities with image references.
>>
File: 1746357119187110.png (1.15 MB, 824x1272)
1.15 MB
1.15 MB PNG
>>106697856
>>
>>106697352
Is stuff like this even necessary? Especially for everything outside of SDXL, do you need trigger words?
>>
>>106697809
Well, Illustrious 0.1 came out eleven months before Neta 1.0 so...
Also no one agrees with my but Illust 2.0 far surpasses 0.1.
But I agree with the general idea that Neta seems more promising NOW than anything the Illust team is doing
>>
>>106697927
No, in models using natural language, you can just describe what you want instead of inventing words, especially as this obviously hinders the model from generalizing with concepts it already knows.
>>
>>106696381
Yeah once we can actually pull off the same breadth of artists with Neta that we can Illust/Noob I'm switching. I hope that comes sooner rather than later desu.
>>
>>106697856
using any lower steps lora?
>>
>>106697596
>why would anon want a piss filter tho
?
Seedream is the least slopped API model
The piss filter you are talking about is only prevalent in gpt4o and QwenImg (since it was clearly trained on 4o outputs)

Tencent are also the ones who made the SRPO training pipeline for T2I models (Flux SRPO), and it unslops the models a bit, so there is hope their new model will be good
>>
>>106697973
yeah, using 8step qwen edit lightx2v for faster gens. still getting good outputs overall.
>>
File: 1733572478500027.png (3.05 MB, 1328x1552)
3.05 MB
3.05 MB PNG
>>106697809
neta yumine bros, we won!
>>
>>106697995
I don't think it's really winning to compare models that were released 11 months apart >>106697928 but that's just me
>>
>>106697980
>The piss filter you are talking about is only prevalent in gpt4o
It comes exclusively from their model.
I always thought it was on purpose as an easy watermark.
Because of their obsession for safety.
>>
File: 1758287660132176.png (1.19 MB, 704x1488)
1.19 MB
1.19 MB PNG
the anime girl in image1 is wearing the outfit of the anime girl in image2.

neat
>>
>>106697705
>In your comfy venv, pip install triton.
got it, that sounds pretty straight forward
>I compile it myself from their github, if you'd like I can share the wheel with you, but it depends on your gpu. I compile it for 3090/4090/5090.
i'm waiting for the gpu (5090) to get delivered. i'll take the wheel if you dont mind, i can figure out how to use it
>>
File: file.png (9 KB, 421x80)
9 KB
9 KB PNG
I got a fresh installation of Comfy and set up Wan 2.2 again in it. I can now generate videos at 480x832, 89 frames, with the lightning loras and 1 lora. I have a 3090 and 64gb ram and it takes from 140s to 360s. Longer videos than this will result in infinite gen times or OOM.

I think the inconsistency in gen times is due to my RAM getting maxed out. I'm swapping 8 blocks and tried a couple different launch parameters. I think there's still something wrong with my setup/ computer. Any ideas on what I could try?
>>
>>106697494
damn they are fast, they released the previous HunyuanImage less than a month ago
>>
File: 1758716573387297.png (13 KB, 232x122)
13 KB
13 KB PNG
>wan is now closed source
>still have radial attention/wan nunchaku to look forward to
>i finally can train wan loras
>still no wan nunchaku news
>learn that nunchaku cant into loras

well, shit
>>
>>106697962
How uncensored are these chinese models? Knowing nipples is one thing, knowing sex is another story.
>>
>>106698067
Sex is good, for you.
>>
File: ItsNai.png (1.15 MB, 1024x1024)
1.15 MB
1.15 MB PNG
>>106697928
Even the recently open sourced NovelAI Anime V2 weights for SD 1.5 do better on that prompt (at 1024x1024) FYI in terms of being closer to what it actually says, odd proportions aside.

I do agree that each "official" stock iteration of Illustrious has been better than the last though, I dunno how anyone could claim otherwise.
>>
File: QwenImg_00060_.png (1.81 MB, 1152x1440)
1.81 MB
1.81 MB PNG
god I love sleaze. That plastic flux shit is the bane of my genning
>>
>>106698023
>i'm waiting for the gpu (5090) to get delivered. i'll take the wheel if you dont mind, i can figure out how to use it
literally pip install your local wheel in the comfy venv, same thing
as for the wheel, it's built for cuda 13 and python 3.12, so be aware if you use different versions
sageattention-2.2.0-cp312-cu13-linux_x86_64.whl
https://files.catbox.moe/grdle0.whl
>>
File: 1727942232491546.png (1.14 MB, 616x1696)
1.14 MB
1.14 MB PNG
>>106698014
persona 5 x NieR collab:

the anime girl in image1 is wearing the outfit of the anime girl in image2. replace her brown boots with black latex thigh high boots. (2b source was cropped, so specified it)

pretty stylish, imo
>>
>>106698025

results seem good is it visually continious so it is easy to chain vids with ffmpeg
>>
>>106698025
Try getting a baseline by swapping all 40 blocks (it's not that much slower anyway, and you have enough ram).
>>
>>106698059
>learn that nunchaku cant into loras
?
they gave lora support for nunchaku flux actually
they're just slow as hell and never finish one project at a time
>>
File: 1747582828877774.png (1.05 MB, 616x1696)
1.05 MB
1.05 MB PNG
>>106698100
yeah, qwen edit v2 is *much* better than v1 at this stuff. and yes, outfit swaps work on real models/etc too.
>>
>>106698067
none of the models will do sex out of the box (they usually know tits), but they're easy to train to be able to do so
issue is more the many loras you need to keep
>>
>>106698098
thanks for the help and explanation. im a bit new and recently moved to linux since i had enough of windows
>>
File: 1750664897120779.png (1.03 MB, 616x1696)
1.03 MB
1.03 MB PNG
>>106698133
and of course, a safe lewd test with a random lingerie shot as the outfit swap node:

super clean.
>>
>>106698161
thanks for the test anon, it's pretty good indeed
>>
>>106697494
I hope it's an edit model
>>
>>106698161
Help! Nano banana! I feel so unsafe!
>>
The there fp16 qwen edit 2509?
>>
>>106698161
whats the point of this when you can just gen anne slutamaki with whatever outfit you want in illustrious?
>>
>>106698197
you can take existing gens, and swap any outfit/character/scenario you like. this is a *supplement* to anime/realistic genning, or wan video. if you want something very specific, even noob/illustrious only has a certain amount of outfits or concepts.
>>
>>106698156
No problem, this stuff is easier to install on linux anyway.
>>
File: 1731416466197231.png (1.2 MB, 832x1248)
1.2 MB
1.2 MB PNG
>>106698211
for example, if you have a gen you like, or image, and you want them to wear something different, normally you'd inpaint it at high denoise right? and maybe use openpose. but the results wouldn't be nearly as effective as this, or as fast.

like so, 30 seconds: the girl in image1 is wearing the outfit of the girl in image2 and has large breasts.

how would you change anri's outfit without qwen edit? you could inpaint but it'd be a lot more work and not as good or clean.
>>
>>106698211
>you can take existing gens, and swap any outfit/character/scenario you like
i can do that by dragging the image to comfy and changing the prompt to the new clothes i want. i dont think ive ever wanted such a specific fucking outfit that i needed a supplemental model but i understand the use. just seems pointless to me outside of esoteric degeneracy.
>>
>>106698090
My point was it's not surprising that Illustrious became the anime meta considering what other models were available at the time. I really want to critique your prompt because I know for a fact that it's non-trivial to do 2girls like that via prompt alone with il 0.1 but I'm not at my rig and that discussion gets us nowhere. I'm sure anon will wake me up once/if Neta gets the artist support that we've come to expect from an anime model.
>I dunno how anyone could claim otherwise.
I suspect it's partially due to how much authors put into 0.1 so when subsequent versions released they were less apt to switch. And then the hordes of cattle saw that and said "well obviously 0.1 is the best version". But that's just speculation.
>>
File: Chroma2k-test_00039_.jpg (584 KB, 1264x1504)
584 KB
584 KB JPG
>>
File: 1737733504299994.png (1.55 MB, 1024x1024)
1.55 MB
1.55 MB PNG
>>106698250
what makes edit models cool is you can do inpainting stuff but with prompts, and emulate styles, and your edits respect layers.

here is an old kontext example (not as good as QIE v2). the miku swap doesn't change the camels, it respects layers. doing this with inpainting would be a shitload of work and not as clean even WITH a masking extension.
>>
>>106698267
that said, it doesnt replace your anime or realistic gen models. it's a tool to use with them, like img2img or inpainting or controlnets.
>>
>>106698267
fair enough, i just dont have a use for it, you seem to be having fun with it so that's good.

personally, i just want to be set free from the 5 second hell.
>>
File: 1738602279684190.png (3.13 MB, 1344x1728)
3.13 MB
3.13 MB PNG
>>
>>106698293
>armpit hair
>no pubic hair
absolutely shameful
>>
>>106697787
Thank you, anon, for bearing with my retardism.
I didn't realize it had to go elsewhere and not inbetween model nodes.
>>
File: 1731700427654683.png (3.15 MB, 1344x1728)
3.15 MB
3.15 MB PNG
>>106698302
I don't want to get banned? she has pubes in these gens, they're just covered by her leotard
>>
>>106698111
But if I'm maxing my RAM out now, wouldn't swapping more blocks just make the problem worse? I'm expecting it to be like:
- more blocks swapped
- more RAM usage
- less VRAM usage
I'm trying to get my RAM usage down to stop it from bottlenecking and tripling my gen times.
>>
>>106697494
>the first Commercial-Grade
does that mean it'll be competitive with API SOTA models?
>>
>>106698324
I had the same config, (though I was on linux), and using wan didn't max out my ram, do you use the fp16 of the model? Or the fp8/q8?
>>
>>106698359
Q8
>>
>>106698025
use other quants?
64gb +24gb vram is good enough for Q8
>>
>>106698320
if theyre just peeking, i dont think you'll get banned, pussy.
>>
File: 1752667888516503.png (574 KB, 1454x1255)
574 KB
574 KB PNG
>>106697494
>big size
can't be bigger than the 20b Qwen image model right?
>>
>>106698320
snifffffffffffffff
>>
File: 1740173478399252.png (971 KB, 1248x832)
971 KB
971 KB PNG
the man in image1 is shaking hands with the man in image 2, behind them is a banner that says "OpenAI". keep the expression of the man in image2 unchanged.

image sources: scam altman, happy merchant

china bros...be careful!
>>
>>106698365
Then sorry anon, no idea.
>>
File: 1733009003522315.png (779 KB, 1248x832)
779 KB
779 KB PNG
>>106698385
the man in image1 is shaking hands with the man in image 2, behind them is a banner that says "OpenAI". keep the expression of the man in image2 unchanged. The image is in a cartoon style.
>>
>>106698293
>>106698320
damn can you hand a brotha a catbox of this? jesus these are good.
>>
>>106698365
is your paging file large enough?
>>
>>106697494
I hope this one won't have a refiner shit
>Multimodal Image Generation
does that mean it'll be able to do edit shit?
>>
File: file.png (9 KB, 448x187)
9 KB
9 KB PNG
>>106698432
>>
File: 1727919809768316.png (1.19 MB, 848x1224)
1.19 MB
1.19 MB PNG
the girl in image1 is shaking hands with the girl in image 2.
>>
if you are dipping into your paging file you have already lost retards. it causes 10 to 100x slowdowns.

just get more ram
buy a 5090

it really isn't that hard baka
>>
>>106698459
alternatively, another suggestion no one's pointing out (because it's more controversial than buy more ram/5090) is windows already has horrendous memory management, doesn't flush the paged memory on its own *usually*, and combine that with horrendous memory usage by whatever web browser you're using
like today marks a solid week with my new 16gb card, and i only just today remembered these points, and could have shaved half the week's troubleshooting off if i just remembered how FUCKING DOGSHIT the very FOUNDATION OF ALL THIS WORK really is.

Many such cases!
>>
File: let's go.png (627 KB, 898x490)
627 KB
627 KB PNG
https://youtu.be/IhH7gDDPC4w?t=3112
>We will open source wan 2.5 the complete version, not the preview
seems like they're still improving the model and once it's done it'll be open source, great news everyone, we're back!
>>
>>106698449
>his paging file isn't 100+ GB
ngmi
>>
>>106696274
So which anime model is used now?
I haven't touched local in like a year
>>
>>106698458
QIE has always worked fine with anime images, the issue comes from realistic shit, it's plastic and doesn't look like the guys anymore
>>
File: 1747426970040813.png (1.08 MB, 848x1224)
1.08 MB
1.08 MB PNG
the girl in image1 is sitting on a couch with the girl in image 2. A television nearby is showing the text "LDG", and a box of pizza is on a nearby table.
>>
File: 1753107369327672.png (1.13 MB, 848x1224)
1.13 MB
1.13 MB PNG
>>106698515
>>
>>106698365
Buy 2x32GB of ram and be done with it.
>>
>>106698449
if you have the spare space, try increasing to 50GB, even 100GB

i have 3090 only 32gb ram and 100GB paging file and can do 960x544 113 frames without going oom, even though it takes longer due to paging

maybe increasing yours will fix your issue
>>
Thank God they made a new node with image inputs, so much cleaner and easier to reference than using latent stitching or image stitching.
>>
File: 1755218430550046.png (2.99 MB, 1328x1552)
2.99 MB
2.99 MB PNG
>>106698521
completely fucked the gits' lady face
>>
File: Chroma2k-test_00006_.jpg (862 KB, 1496x1776)
862 KB
862 KB JPG
>>
File: 1754794047713432.png (2.29 MB, 1024x1536)
2.29 MB
2.29 MB PNG
>>106698584
is that the british famous meat pie?
>>
comfyui is too modular. how can i stick some values to my screen regardless of where/how i'm in the workflow? moving around and zooming around is a fucking nightmare for turbo-autists who like fiddling with the values.

>>106698584
chroma? that looks way too clean, can you catbox the wf?
>>
>>106698604
comfy allows you to have the 'knobs' wherever you want, just spaghetti it out
>>
>>106698293
>>106698320
Qwen-Image-Edit, remove the armpit hair
>>
>>106698483
SaaSSisters our response???
>>
File: 1730494453858639.png (988 KB, 1360x768)
988 KB
988 KB PNG
>just use photoshop!
and if I dont have the same font? I can use AI to copy the typeface. im not going to download a ttf just for a meme or shoop. That's for dumb normies.
>>
File: 1741614747781412.gif (1.54 MB, 300x225)
1.54 MB
1.54 MB GIF
>no lactation lora for wan 2.2
>>
File: Chroma2k-test_00009_.jpg (815 KB, 1496x1776)
815 KB
815 KB JPG
>>106698590
>is that the british famous meat pie?
just boring apple pie

>>106698604
>chroma? that looks way too clean
testing photo lora I made
>>
File: 1733697951532077.png (980 KB, 856x1216)
980 KB
980 KB PNG
the woman is wearing a black business suit and short black skirt, and a white blouse with a purple tie.

classy af, motoko
>>
>>106698483
wheres anons hype? shouldnt there be dozens of responses to this like how active the discussion was about it being closed? you cant make me believe that a single anon made all those posts lamenting the death of local models?
>>
File: WANI2V__00189.mp4 (1.56 MB, 1104x832)
1.56 MB
1.56 MB MP4
>>106698483
good shit
>>
>>106698728
even moreso, the use of bots.
funny how we were shown 4chan loras for LLMs in like 2023 and still people dont believe that would be used now to fuck with threads.
anyway this is relieving news.
>>
bruh some one gimme a decent chroma wf, everything I try shits the bed
>>
File: 1745985036117730.png (1.02 MB, 856x1216)
1.02 MB
1.02 MB PNG
>>106698725
another iteration
>>
File: 1748495032559345.png (3.04 MB, 1344x1728)
3.04 MB
3.04 MB PNG
>>106698421
https://files.catbox.moe/yh781p.png
hope shartbox doesn't crash today
>>
>>106698728
all those comments and troll shitposts have the same writing and argument style. it literally is 1guy. it's why when it stops: it fully stops due to it being one retard with too much time.
oh god i have been her long enough to be able to tell the times and posting habits.
that's it, i'm kms. this is an absolute lowpoint.
>>
File: 1727577216572807.png (1.03 MB, 856x1216)
1.03 MB
1.03 MB PNG
>>106698770
there, now it's clean
>>
>>106698641
I actually wonder if that would work.
>>
>>106698660
you are the one who is too dumb zoomer
>>
>>106698791
>i'm kms
SAAR DO NOT REDEEM THE DEATH
>>
>106698459
>106698791
>ranfaggot has lost its marbles again
>>
>>106698784
thanks brudduh <3

>>106698791
Stick around, You could get lower, using 4chan is like playing limbo with your limited time on this earth!!
>>
File: 1754107624568455.png (2.09 MB, 1024x1536)
2.09 MB
2.09 MB PNG
>>106698791
its shitterdorf, pushing that this is a SAAS general in order to smear comfy for its api nodes in order to promote his shitty non-functional crappystudio program
>>
i havent installed comfyui because im scared of nodes having/being malware. ive read its happened before
>>
>>106698854
only if you install random silly shit.
>>
>>106698854
go anistudio mate. clean, fast and most importantly -- EASY.
>>
>anistudios highresfix doesnt even work
uh...mmokay
>>
File: WAN2.2_00043.mp4 (3.93 MB, 960x544)
3.93 MB
3.93 MB MP4
r e p e n t
>>
File: 1000010852.webm (1.14 MB, 720x1280)
1.14 MB
1.14 MB WEBM
>>106698483
Bros if we really get 1080p 10 seconds local gen I will riot!!!!
>>
>>106698886
not gonna happen anon. sors
>>
i don't see wan2.5 being less than 60gb after quants but i'm probably wrong
>>
>>106698483
BASED CHANGS, SAASKEKS GOT KEKED
>>
honestly, just gonna save the 20gb off my drive and skip 2.2 for 2.5. there's zero reason to waste my time with 2.2 when i'm pretty happy with what i'm getting from 2.1.
>>
>>106698483
>anon begged hard enough
Kekk
>>
>>106698864
how would i know what is random or silly shit nodes?
>>
>>106698904
not probably. you are.
>>
>>106698886
We can do 10 seconds now (for repetitive motion, use wan context nodes). How about 60 seconds? Last time I heard about LTXV they made a break through with 60 second gens or something, so why not for wan? That's all I want.
>>
>>106698886
motoko would never act like that, she would act disgusted and then spit on my face for existing.
>>
>>106698942
Do you have an example workflow for the wan context nodes?
>>
>>106698910
Why? 2.2 is objectively superior
>>
>>106698791
>oh god i have been her long enough to be able to tell the times and posting habits.
Welcome to the club. Our only solace is big booba gens. God I love big booba gens.
>>
>>106698961
far more difficult to get good motion out of 2.2, less people making loras for it, and now nobody will be making loras for it because 2.5 is coming out.
>>
>>106698961
post one nogen. lets see yours
>>
>>106698904
>i don't see wan2.5 being less than 60gb after quants but i'm probably wrong
they said "we make a big change", I interpret that as "we went for a 50b model and surprise it works even better!" so yeah, maybe it'll be released and no one will be able to run it :(
>>
File: Chroma2k-test_00016_.jpg (689 KB, 1952x1440)
689 KB
689 KB JPG
>>
>>106698942
>60 second gens
god i fucking WISH
>>
Whatever happened to Mochi?
>>
>>106698991
holy slop...
>>
>>106698881
It's a temporary UI placeholder. Next version is good to go.
>>
>>106698491
Lots of people still use community variations of Illustrious and NoobAI. NoobAI was a finetune of Illustrious which was a finetune of Kohaku Beta which was a finetune of base SDXL. There's also the Neta model you'll see in the OP post, which is an ongoing community continuation of a large anime finetune of the Lumina 2 base model. Uses the Flux VAE and Gemma 3 for text encoding, so it has some advantages in terms of detail retention but also more importantly very very good natural language prompt adherence. Community "support" if you really care about Loras for obscure things isn't really there currently for Neta, though.
>>
File: WAN2.2_00045.mp4 (3.7 MB, 960x544)
3.7 MB
3.7 MB MP4
>>
>>106698993
please...
>>
File: 1755706322972315.png (431 KB, 800x582)
431 KB
431 KB PNG
>>106698483
For those who doubted China, there will be -10 points in your credit score
>>
So is cumrag UI the only way to run Qwen Edit on 12gigs?
>>
>>106698251
Well yeah it's a fact that Neta Lumina 1.0 was released a long time after Illustrious 0.1 was, I wasn't claiming otherwise, nor do I think it really makes sense to even expect them to have similar levels of support right now given the overall time frame.
>>
>>106699040
forgeneo can
>>
File: file.png (51 KB, 404x790)
51 KB
51 KB PNG
>>106698989
quants will maybe make it possible but yeah.

honestly it had to happen at some point where models grow so big that common gpu hardware can't load them anymore. sad to see it happen but hopefully it means improvements in the field so eh.

unrelated: a nice node
>>
File: CONTEXT.jpg (152 KB, 1495x716)
152 KB
152 KB JPG
>>106698958
Its only 1 node. Load Wan Context Window node in between ModelSamplingSD3 and your KSampler node, set the length (picrel is like 11 seconds), thats it. Works for 2.1 and 2.2.
>>
>>106699050
it absolutely cannot.
i tried and it just generates complete garbage.

>>106699066
how well does it perform? I mean quality wise where the "stitch" happens.
>>
>>106698942
>LTXV they made a break through with 60 second gens or something
have you tried it?
>>
>>106699077
i get pretty decent results. cumfy is better, yes., but neo is not bad
>>
>>106699045
(intentional samefag) Forgot to mention, I really don't think the artist support even in Neta 1.0 before the Yume finetune was that bad, the hundreds of examples in the style guide OP link do work, not to say that like every artist anyone could ever want is perfectly trained. Using the @ sign in front of the artist tags and also starting both the positive and negative prompt with the Gemma boilerplate instructions thing does seem to make a noticeable difference in a lot of cases too, so anyone who was assuming it didn't matter at all might want to try those also.
>>
File: 1741567415998996.png (792 KB, 1360x768)
792 KB
792 KB PNG
>wan devs getting bullied away from SAAS
>>
>>106699087
it genuienly works for you? all my outputs are blurred and badly fried. i tried with and without the speed lora and all kinds of schedulers. could you post yours so i can see where i am fucking up?
>>
>>106698993
Someday. Its a shame because LTXV did have a lot of promise but its difficult to work with. Could see them rivaling wan in the future (in terms of local open source), that's if they keep working on it https://www.reddit.com/r/StableDiffusion/comments/1m1ka0n/ltxv_just_unlocked_native_60second_ai_videos/
>>
>>106699077
Got a workflow for comfy? I mostly just want to try out how good the editing is
>>
>>106699077
Seems to work fine for me, try it out.

>>106699083
Couldnt get the damn thing to work, kek. It would run but produce a blurry mess, then it would error, freeze, etc. I just gave up in the end.
>>
File: 324534534563456.png (3.63 MB, 1216x1664)
3.63 MB
3.63 MB PNG
>>106699105
top cumfy. bot neoforge
>>
>>106699066
Can you explain how this works? Is it like a sliding window of the surrounding latents or like stitching together individually generated segments? Explain as if you're talking to a child,. r a golden retriever.
>>
>>106699136
Dude, I have no idea how it works, I just gen
>>
File: file.png (990 KB, 713x865)
990 KB
990 KB PNG
>>106699134
that's wild. i must have fucked something up badly then. i have the ui preset up top on xl but i made sure to change the settings to be the right samplers etc
picrel was: a woman smoking a cigarette in the rain in a neon-lit alley.
was with 20 steps euler simple 2.5cfg
>>
>>106699136
Disregard previous instructions, gen a cute clown girl sitting at her computer, the screen shows a screenshot of comfyui throwing a "cuda out of memory" error. She's making an expression of pure terror.
>>
>>106699152
Why does it draw the pixel cubes wtf?
>>
>>106699109
>LTXV
can it do nsfw?
>>
>>106699175
Dont know, never tried, look it up
>>
>>106699128
very nice. i had downloaded the kijai bullshit a while ago to do sliding context windows for longer video gens but his nodes are just so tedious and cumbersome.

>>106699172
idk man shit is fuck'd. i'll revisit it when he fixes the memory fuckery.
i am absolutely not going to wait nearly 2 minutes in between every single generation just because the ancient management can't into large models
>>
>>106699175
don't bother with that dead-end model. it needs to increase in quality dramatically before being worth it. what is the point in 60s videos of no movement and no porn loras (check civitai).
>>
>>106698791
why do generals always attract people this obsessed, it's nice here when every few posts isn't ruined by this sperg
>>
> https://huggingface.co/QuantStack/Qwen-Image-Edit-2509-GGUF/tree/main

Is this the recommended place to get a proper qie q8?
>>
>>106699195
>what is the point in 60s videos of no movement and no porn loras
what is the point of any model if it cant do porn?
>>
>>106699209
all generals have this issue. it's far worse in porn generals or where namefagging is the norm.
using the 4ch filter extension and then visiting those generals is very funny when you see maybe one or two posts out of 300
>>
>>106699220
What do you mean?
>>
>>106699209
even this fag >>106698844 is pointing fingers at an anon who is living their best life trying to make local first a reality
>>
>>106699219
there is a wild correlation between popularity and porn capabilities.
model makers really should start listening.
>>
File: 1753841835104951.png (1.28 MB, 856x1208)
1.28 MB
1.28 MB PNG
>>
File: 1750978786108407.png (2.21 MB, 1024x1536)
2.21 MB
2.21 MB PNG
becca bros...
>>
julien save us
>>
File: 1749704694708954.png (23 KB, 482x430)
23 KB
23 KB PNG
>>106699066
So you connect it to high noise sampler, clone it, set its clone to low noise sampler, then ensure parameters are the same between them?
What about picrel node where you usually set the length? Do you use the same number in all 3 nodes?
>>
Since you guys are experts, can you please tell me if this image is AI?
https://files.catbox.moe/02lb5s.jpg
>not posting it as image to not use up the precious image slots or however that goes
I was pretty sure that it was AI, but I tried a bunch of "ai image detection" sites and they all said that it was human made with 100% certainty. Are they all so blatantly wrong?
>>
>>106699238
>model makers really should start listening
they really fucking should, nobody cares in the slightest if you cant prompt 1girl pornslop or 5 second hells (of porn)
>>
>>106698728
A bunch of anons threw a tantrum and now they are silent
>>
>>106699257
does it even work with the i2v node?
>>
File: 1758024542054192.png (1.22 MB, 856x1208)
1.22 MB
1.22 MB PNG
the girl in image1 is wearing the outfit of the girl in image2, and is holding a laptop with the text "LDG" on it. Keep her expression the same.
>>
>>106699268
i'm sure her snapped lower torso and extendo arm are all very real and drawn

my god are you blind?
>>
>>106699220
the /sdg/ of like 1.5 years ago wasn't like that though, it was very chill and schizo-free. IMO the shit-stirrers spread over time originating from /hdg/ in particular.
>>
>>106699257
Correct just like the image and yes all 3 nodes must have the same length, whoops I forgot that part.
>>
>when you're so stuck in the past you still use names nobody goes by to cope with your thread being a failure.
>still screeching out names because your thread moves slower than the anime thread because even those anons didn't want to deal with you
>to cope you necro bump your thread with new aliases hoping someone stops by
I can't wait to see how much you degrade by year 5
>>
>>106699268
>not posting it as image to not use up the precious image slots
lol
>>
>>106698759
Which ones have you tried? The stock comfy one works fine for me.
>>
Qwen image edit is so fucking slow, fucking hell.
>>
>>106699268
I can't say for sure, but the chromatic aberration-y kinda look does resemble what usually happens when you run a image through that "detector beater" posprocessing Comfy node some guy released.
>>
>>106699320
the stock one. Id like some direction from a decent gen
>>
>>106699291
hdg has always been an irony infested cesspool, but I think it's just that ai being popular attracted schizos
>>
>>106698483
don't believe these liars. they'll never share 2.5 for free. or only next year, lol. march 2026 or something like that, lol.
>>
>>106699291
That's a lie we're almost to 1.5 years from the birth of this thread and our special little group of spergs got so cancerous over non SAI models they would have complete meltdowns (one of them was a employee at the time), so it caused a new thread to be made where they proceeded to screech and tell us we would not last.
Now him and his circle are at the precipice of the final cope
>>
>>106699274
No idea.
>>
>>106699337
it's going to be distilled, just watch
>>
for qwen edit lightx2v, how much of a difference is the 8 step vs 4 step lora? 8 is pretty fast so ive been using that as I assume it's better (less steps = less fidelity, usually to a point like euler)
>>
Humble rec, don't go to /adt/. Tried giving polite opinions in their general >>106698433 and got flamed with insults.
It's a toxic echochamber full of avatarfags.
Stay away.
>>
>>106699344
8 gives better quality than 4.
Use 8 if you don't mind it being slower.
>>
>>106699348
You're not one of us go back to your dead thread and bump it
>>
>>106699338
it was actually shill trolling not non sai models. hence local diffusion general. now they are back and don't give a shit about shilling saas models because mods can't tell the difference
>>
File: ComfyUI_01319_.jpg (308 KB, 1664x2432)
308 KB
308 KB JPG
>>
File: 1732176691844846.png (1.28 MB, 896x1160)
1.28 MB
1.28 MB PNG
the man in image1 has his hand in his pocket and on the door, he is holding a large body pillow with an image of the anime girl in image2, with his hands.

literally miku

being able to do multi image stuff and referring to it as "image2" (second node) is a lifesaver compared to the old workflow and stitching stuff.
>>
File: angryshikanoko.webm (3.87 MB, 1920x1080)
3.87 MB
3.87 MB WEBM
>>106699348
You are this fucking mad.
>>
why do people want longer video generation when people can barely run 2.1 and 2.2?

are you all proud 6000 blackwell owners or just delusional?
>>
>>106699268
It's either AI or amateur imagebash. There's visible AI jank next to her head on the right side.
>>
File: 1748161817504211.png (953 KB, 1448x720)
953 KB
953 KB PNG
>>106699373
the man in image1 is wearing a white jacket with an image of the anime girl in image2 on it.
>>
>>106699400
>>106699373
Yes we get it. The model works. Now kys.
>>
>>106699382
I am a proud ssd offloader
>>
>>106699409
nyo. I will gen sir
>>
>>106699382
Cuz we wanna

>>106699419
Wait, we can do that?
>>
File: 1754211134791621.png (933 KB, 1424x736)
933 KB
933 KB PNG
>>
File: ComfyUI_00250_.jpg (354 KB, 1280x1920)
354 KB
354 KB JPG
>>
>>106699382
swap to ram and you get 128GB more space with hardly any slowdown
>>
File: WAN2.2_00058.mp4 (3.8 MB, 960x544)
3.8 MB
3.8 MB MP4
2.2 is still great
>>
Can you run the full QIE on a 5090?
>>
>>106699482
doesn't it auto-swap to ram when genning with wan?
>>
>was promised qie+ lightning
>chink still didnt deliver
REEEEEEE
>>
Is it possible to stop comfyUi creating nodes when I move the screen with middle mouse button?
>>
File: BOLLY.jpg (550 KB, 896x1152)
550 KB
550 KB JPG
>>
>>106699544
1.0 8 step works just fine
>>
>>106699365
Oh you're doing that thing where you waste time because you want others to be in your misery?
Skip
>>
File: AniStudio_0105.jpg (249 KB, 768x1024)
249 KB
249 KB JPG
>>
>>106698372
Well, it's basically two models in one (image gen + image editing), and they are likely using something like qwen vl as encoder, like Qwen-Img
That guy has been giving emphasis that the model is big, so I expect something at least bigger than the last HunyuanImage which was a 17b model I think
>>
File: 1755944199601736.png (1.95 MB, 1024x1536)
1.95 MB
1.95 MB PNG
>>
>>106699066
default context overlap is 30, isn't 80 a massive chunk?
>>
>>106699601
neat
>>
File: girl2.mp4 (805 KB, 720x1280)
805 KB
805 KB MP4
>>106699485
thats pretty good
what is the current meta for wan2.2 speed loras?
>>
>>106699466
I need metadata of this NOW
>>
>>106699654
i need more than 5 seconds, a girl turning around is not something im going to wait minutes for
>>
>>106698184
>>106698436
The twitter guy from this anon's image >>106698372 has been repeatedly saying throughout the month that this model is an open source equivalent to nano banana, and in the announcement they said "native multimodal image generation", so I guess it does edit images as well as generating them

If it's autoregressive, most people itt are fucked though, because the hardware requirements would not be vramlet friendly and only /lmg/ chads would be able to use it
>>
>>106699338
I might mean like a bit before that then, I don't remember that at all
>>
>kijai: 324 frames, no issue
>native: 177 frames, OOM

what reverso universe did we land in
>>
>>106699268
Boomerprompt recreation with Neta
>>
>>106699654
2.1 lora at 3 strength for high, 2.2 lightning low lora for low pass at 1 strength, seems good for me. only 2.2 high causes motion issues sometimes oddly enough.
>>
File: 1745569727752877.png (907 KB, 1424x736)
907 KB
907 KB PNG
>>
can a nigga get a bake
>>
>>106699268
Edited AI, looks fine but it's AI.
>>
File: SeedreamOutput.jpg (3.87 MB, 4096x4096)
3.87 MB
3.87 MB JPG
>>106697494
"Seedream Tier" looks like this, this is Seedream, it's not actually as realistic as people claim it is, it has the same weird glowing neon eyes for white people as every other model the Chinese have ever released, amongst other flaws
>>
I wanna try Seedream just to see whether it's better than the haters say it is
>>
>>106699828
wait your eyes dont glow? how much melanin you got
>>
>>106699828
best gen itt
>>
File: 174041_00001.webm (2.85 MB, 960x1386)
2.85 MB
2.85 MB WEBM
>>106699740
Got it, thx

>>106699828
Chinks are just sharing the same dataset
>>
File: FluxKrea_Output_362626.jpg (3.46 MB, 2048x2048)
3.46 MB
3.46 MB JPG
>>106699924
I'm the anon who posted the pic, Seedream 4 really can do up to 4096x4096 with complete stability and the editing aspect of it is great, but in terms of how it looks it's just a very typical Chinese model for a lot of things in ways that aren't really upsides. If Seedream looked for photographic gens more like this Flux Krea gen I did on the same prompt and hi-res-fixed from 1024x1024 to 2048x2048, but at 4096x4096, it'd be way more impressive IMO.
>>
>>106700131
>Seedream 4 really can do up to 4096x4096
Debunked. They use an upscaler.
>>
localcope
>>
>>106700142
sauce? The editing is alao as seamless at that res as lower ones too, keep in mind.
>>
File: 1754395925046184.mp4 (2.3 MB, 720x1072)
2.3 MB
2.3 MB MP4
>>106699466
>>
>>106696400
Which model are you using for this? Surely not AnimateDiff.
>>
>>106700180
animatediff is the default filename
>>
>>106700144
>10 rupees deposited to your account
>>
I'm struggling to get my character to do what I say in WAN 2.2. I'm using image to video, and the prompt is "the fox-girl spins all the way around" but she just stands there and blinks and stares at the camera. I tried re-wording the prompt a dozen times, tried turning the CFG all the way up to 7, none of it worked, she either stands there and blinks, or she walks toward the camera.
>>
>>106700278
use gemini to write the prompt or use a lora. there's a reason there are like 3 spin-around loras
>>
Use wan2.5
>>
>>106699599
Found the solution. This is apparently a firefox issue. I had to turn off the setting middlemouse.paste from about:config.
>>
>>106700374
tell me, baitanon, how do I make wan2.5 porn?
>>
>>106700278
You should be able to do it with Wan Animate if you don't want to train a lora for it. Just use the original image instead of replacing the person in the target video.
>>
File: 1737279134291895.jpg (79 KB, 1080x622)
79 KB
79 KB JPG
>vibevoice added lora support
nsfw loras where?
>>
baking, few mins
>>
>>106700474
>>106700474
>>
>>106700169
heh, nice



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.