[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Six Gorillion Lines Edition

Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev: >>106666599

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2122326
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbours
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
File: 00607-3573990253.png (2.21 MB, 1824x1248)
2.21 MB
2.21 MB PNG
someone like me shouldn't have the power to make these images in less than a minute at hireses

>>106669779
not bad
>>
File: 1753890205042292.png (380 KB, 1920x1080)
380 KB
380 KB PNG
Qwen Image Edit PSA

Always add:
"without changing anything else about the image"
at the end of your prompts if you want to preserve anything at all from the original image

Also here's a great workflow for the old Qwen Image Edit model
https://files.catbox.moe/6wcz4m.png
>>
File: file.png (923 KB, 1120x928)
923 KB
923 KB PNG
>>106669789
You dropped >>106669779
>>
File: 00002-3703196870.png (2.32 MB, 2048x816)
2.32 MB
2.32 MB PNG
>>106669789
blessed thread of friendzoned ;3
>>
>>106669808
saving this b4 janjan delet due to nippies
>>
File: 1745542616215011.png (770 KB, 1120x928)
770 KB
770 KB PNG
the destiny of every ShillAI employee:
>>
File: 1734602400556950.png (781 KB, 1120x928)
781 KB
781 KB PNG
>>106669829
cleaner

like Microsoft's asshole after Sam licks it
>>
https://huggingface.co/dannygroove666/Qwen-Image-Edit-2509_fp8_e4m3fn.safetensors/tree/main
there's the fp8 version
>>
ookay ookay bub we get it
>>
Can someone give me a quick rundown in this "AniStudio" thing?
>>
>>106669859
It's the best frontend for generating everything that's hated only by one schizo transphobe itt.
>>
File: 00622-4127018980.png (1.91 MB, 1248x1824)
1.91 MB
1.91 MB PNG
>>106669823
yknow what, you might be a tripfag, and a kinda annoying one, but knowing you're also banned from civitai makes you a brother.

>>106669827
eh they're clothed it should be fine. i had to debate whether to post 99% of the wedgie gens i did because they seemed to toe the line.
of course if i'm wrong, well. whoop. lol.
>>
>singular schizo theory
>>
File: RA_NBCM_00004.jpg (744 KB, 1872x2736)
744 KB
744 KB JPG
>>
>>106669891
>brother
we are all brothers, on earth, in christ's love.
>>
With the new qwen edit local finally reached 1/4 of Seedreams power. Impressive
>>
>>106669891
What base model are you using for these? I'm impressed by the expressiveness.
>>
>>106669871
Thanks. I really like to try it. Do you have the link perhaps?
>>
File: 00631-3178153112.png (2.04 MB, 1824x1248)
2.04 MB
2.04 MB PNG
>>106669912
>we are all brothers, on earth, in christ's love.
hard as fuck man a-fuckin-men.

>>106669920
wai-nsfw v140 with an expression lora, "one piece funny face".
and 20 hires steps + adetailer for the face
>>
File: ohHELLnoo.jpg (59 KB, 741x533)
59 KB
59 KB JPG
>>106669811
>>106669773
where do i put this shit nerds???? I AM ALMOST DONE NO THANKS TO YOU
>>
>>106669930
Of course! https://github.com/FizzleDorf/AniStudio/
>>
>>106669952
you turbo nigger retard, read it, it literally tells you the folder paths right there
>>
>>106669952
learn to read the op next time, jamal
>>
>>106669952
>comfy
> in 2025
>>
I spoke to John Qwen and he said wan going forward will be API only.
>>
another try with new QIE
>prompt = "the anime girl from image 2 is standing in the foreground of image 1, looking back."

It's also not a very good result.
Maybe I'm asking too much.
I'm supposed to be doing work so I can't spend more time looking for better pictures to test, but I'll try a few more later.
>>
>>106669952
you're almost there! be sure to post the vids you make :3
>>106669942
<3
>>
anistudio doesn't have this problem btw
>>
>>106669952
>wan 2.1
>>
>>106669985
When I posted some images from the last thread I was getting victim blamed for follow the instructions on the website. It's a step up, but it's not perfect. That's for sure.
>>
File: 1748610647021479.png (922 KB, 1120x928)
922 KB
922 KB PNG
qwen edit is a great pepe generator btw
>>
>>106669985
it completly changed the background, it's so bad
>>
>>106669789
wait waht? how do you combine a video and pictures? is that some new formate?
>>
>>106669912
pedo christcuck
>>
>>106670064
a video is a set of pictures, so it's just repeating the same picture over and over during the video
>>
File: 1742426960544140.jpg (54 KB, 1000x720)
54 KB
54 KB JPG
>>106670088
>>
File: 1748200804103691.png (946 KB, 1120x928)
946 KB
946 KB PNG
>>106670038
>>
File: 1732211505032623.jpg (772 KB, 2016x1152)
772 KB
772 KB JPG
>>
File: WanimateCollage_00008.mp4 (3.71 MB, 1048x1440)
3.71 MB
3.71 MB MP4
Wan Animate is good at 3D to 3D. Trying to use 2D reference to 3D or vice versa gave bad results.
>>
File: 00681-1230519174.png (2.05 MB, 1824x1248)
2.05 MB
2.05 MB PNG
>>106670120
dunno why this in particular made me wheeze laugh but here i am, giggling again.
>>
>>106670157
dead on arrival
>>
>>106670157
How about 2D to 2D?
>>
>>106670174

Don't have any 2D reference video on hand. I'll try to find something to test.
>>
>>106669952
can i skip the 30gb one and just put my own checkpoint in that i have already tested? this shit is takling too long i dont know how you guys have patience for this shit
>>
https://huggingface.co/calcuis/qwen-image-edit-plus-gguf/blob/main/qwen-image-edit-plus-q4_0.gguf

slowly getting to q8, might just try this for now
>>
>>106670161
I Fairr,stockng wis die thary linge!
>>
Wan2.5 should have similar requirements to 2.2 right? Already pushing the limit of my 4090 here
>>
>>106670120
brainlet, you do know videos can contain still frames yes?
>>
>>106670202
>might just try this for now
why not going for fp8 instead >>106669844
>>
>>106670246
Nobody knows yet, but I expect it to be a Veo 3 competitor.
>>
>>106670261
404, it's gone.
>>
>>106670276
I screenshotted this image and sent it to oxford dictionary so they can use it in their picture dictionary under the word "cope"
>>
>>106670277
still here
https://huggingface.co/dannygroove666/Qwen-Image-Edit-2509/tree/main
>>
>>106670276
lol
>>
>>106670297
wtf the other one is 40gb
>>
>>106670325
well yeah it's a 20b fp16 model
>>
>wan2.5 is releasing mid 2026 and people haven't even fully migrated to wan2.2 yet
imagine taking a break from ai for a few months. you'll be so behind you may as well be starting new
>>
So there is no way to know the trigger word in civitarchive loras unless it is one of the few ones archived with that knowledge included, right?
I wish they included civit links instead of just saying deleted, maybe could have tried web archive or something.
>>
File: file.png (83 KB, 1448x723)
83 KB
83 KB PNG
> kijai nodes
> "simple" i2v wf

absolute disgrace
>>
>>106670346
civitarchive is fully vibecoded and has been abandoned for months. don't expect it to work.
>>
File: 1634425668128.jpg (94 KB, 471x388)
94 KB
94 KB JPG
>>106670157
it's fucking better using dance on i2v. animate is so bad
>>
>>106670364
Do you know a non-abandoned alternative?
>>
>>106670355
it's really not that complicated
>>
>>106670375
maybe not complicated but there are far too many moving parts that are just so unneeded
and it looks messy.
if native had context windows for wan i would delete all his nodes
>>
>>106670372
My local backup
>>
>>106670235
WHEEETEEEZE
>>
>>106669789
Catbox on the second from the left on the top please?
Is there any way to use an image as reference to get the same style/outfit on new gens?
>>
>>106670334
>wan2.5 is releasing mid 2026
Niggas here were saying the 24th of this month
>>
>>106670346
pretty much. if the lora has no metadata then it is impossible to know.
>>
>>106670157
Workflow?
>>
File: RA_NBCM_00007.jpg (1 MB, 1872x2736)
1 MB
1 MB JPG
>>
>>106670408
it's supposed to be tommorow or in 2 days yeah lol
https://xcancel.com/bdsqlsz/status/1969650994192794103#m
>>
>>106670334
>wan2.5 is releasing mid 2026
lel, are you retarded ?
>>
Is the new qwen censored or am I good to download and edit boobs onto everything?
>>
>>106670426
artist? really cool
>>
File: WanimateCollage_00009.mp4 (686 KB, 812x896)
686 KB
686 KB MP4
>>106670421

Kijai example workflow.

https://github.com/kijai/ComfyUI-WanVideoWrapper/tree/main/example_workflows
>>
>>106670446
have a real deep think
>>
>>>106665054
is there a reason to resize the i2v source image before genning?
doesn't it get resized at the end into whichever frame size you choose anyway?
>>
>>106670435
If I open that link and it's the blue dragon I'm going to flip my shit.
>>
>>106670447
It's Nakayama Tooru
>>
>>106670455
Do you need a nasa pc to run it?
>>
>>106670465
>is there a reason to resize the i2v source image before genning?
well if you don't have much vram it's good to go for a small image, and it's slower if the image is big too
>>
File: le.png (1.25 MB, 1472x704)
1.25 MB
1.25 MB PNG
>>
>>106670435
who the fuck is that? why should i trust that person? how do they know the release date?
>>
>>106670458
Haven't touched image gen in a year or so, I have no idea if you're referencing something or not. Was just looking for a model to get back into it with and hoping to use something uncensored.
>>
>>106670472
thank you anon
>>
>>106670481
lurk more, he's a guy that gets invited by all the big AI companies in China, when he announces something, it always happens
>>
>>106670494
So if it's not out in 2 days, I'm free to call you a retard, correct?
>>
>>106670455
Probably doesn't recognize what to do with the anime eyes there but movement seems to translate better than 3D ones.
>>
>>106670501
good luck with that, he's right since he announced Wan 2.1
>>
https://huggingface.co/QuantStack/Qwen-Image-Edit-2509-GGUF/tree/main

models are popping up!
>>
>>106670478
fucking this
>>
>>106670550
>Q4_1
oh come on, why do they never start with the biggest quant :(
>>
>>106670550
whats the difference between qwen-image-edit-plus and Qwen-Image-Edit-2509
>>
File: 1731228389412961.png (108 KB, 2507x686)
108 KB
108 KB PNG
People will post the fakest posts with the biggest aplomb.
>>
>>106670563
what's the difference?
>>
>>106670598
Q8>Q4
>>
File: 1730860034950330.jpg (2.04 MB, 7961x2897)
2.04 MB
2.04 MB JPG
>>106670598
>what's the difference?
>>
File: file.png (867 KB, 2117x1076)
867 KB
867 KB PNG
how can I make the change stronger?
>>
>>106670597
no fucking way I just saw this and was about to post this here.
>>
>>106670597
fucking lmao.
the dumb cunt could have just asked how2faceswap but had to add a fake backstory.
>>
>>106670202
>>106670550
I don't want to be ungrateful but what's with this rollout... most people either use q4km or q8, why not start with those?
>>
>>106670616
is it the old or the new qwen image edit?
>>
>>106670627
old
>>
>>106670613
The milage this comparison's gotten is insane.
>>
>>106670593
bump
>>
>>106670632
try the new one then >>106670297
>>
So how do we give multiple references to the new Qwen edit plus in comfyui?
>>
>>106670617
It was such a retarded post.
>>
>>106670638
Need a new node or update to the existing one most likely.
>>
>>106670638
you go for that new TextEncodeQwenImageEditPlus node
https://github.com/comfyanonymous/ComfyUI/pull/9986
>>
>>106670613
Thanks.
>>
>>106670661
Hello sir, I make six figure and want to explore wifes bob and vageen on only fans. How to put wifes face on bob and vageen?
>>
File: ComfyUI_03086_.mp4 (1.31 MB, 720x1280)
1.31 MB
1.31 MB MP4
Did you know wan can do pullups? Maybe not that interesting.
>>
File: 1530495466412.jpg (33 KB, 600x564)
33 KB
33 KB JPG
>>106669952
>>106670189
downloaded all that bullshit
comfy just CLOSES what the fuck is this shit?

"got prompt
Using pytorch attention in VAE
Using pytorch attention in VAE
VAE load device: cuda:0, offload device: cpu, dtype: torch.bfloat16
Requested to load CLIPVisionModelProjection
loaded completely 98657.8 1208.09842069125 True
Using scaled fp8: fp8 matrix mult: False, scale input: False

C:\ComfyUI_windows_portable>pause
Press any key to continue . . ."

fag shit
>>
File: 1733026527108014.png (1.55 MB, 1024x1024)
1.55 MB
1.55 MB PNG
still the old model, waiting for the new one (q8 ideally)
>>
>>106670674
well since your such a python genius, why don't you figure it out by yourself?
>>
>windows
>portable install
LMAO
>>
>>106670685
I can do pullups too. Should I post a vid as well?
>>
>>106670685
Keeps the style consistent, that said it would be interesting if you prompted for a change in expression to see how it handles
>>
>>106665874
need this with little bit smaller hands touching her, those look like they belong to 8-foot giant that doesn't fit in that little space they are both in and is using noclip to fit anyway, kek
>>
File: 1747077910019438.png (1.46 MB, 1024x1024)
1.46 MB
1.46 MB PNG
>>106670691
>>
>>106669952
>>
>>106670700
HALP
>>
>>106670738
that no longer looks cool at all, scatfag
>>
File: artosis.png (371 KB, 628x528)
371 KB
371 KB PNG
>accidentally genned a man i am attracted to
>>
>>106670745
kek, he's right you know- we shouldn't make time for fag shit, when it could just WERK.
>>
I have never once had an issue with comfy UI since using my own install with my own environment. Portable is a trap and nightmare to maintain.
>>
>>106670745
absolutely uncivilized vermin
>>
>>106670751
>When you see another fucking Protoss go Nexus first on cross spawn
>>
>>106670745
This is how Furkgod got his start as Turkey's leading computer engineer
>>
>>106670478
whats the problem? They all work fine on my 5090.
>>
>>106670777
I found out furk blocked me the other day. I'm actually kind of sad.
>>
>>106670777
>Turkey's leading computer engineer
He cannot be stopped
>>
in a workflow, how do you know exactly which node follows another?
or does order not matter as long as all needed nodes are connected to each other?
>>
File: habbening.jpg (69 KB, 498x456)
69 KB
69 KB JPG
"stashing current changes
nothing to stash
creating backup branch: backup_branch_2025-09-22_17_27_57
checking out master branch
pulling."

what??????????????
>>
>>106670793
it means that it's updating and there's no errors so far
>>
>>106670793
Oh shit, RIP, it's over for you...
>>
>>106670793
Oh fuck, rip out your SSD now before it spreads.
>>
>>106670791

The freedom to connect shit also cause confusion. Anyway, just drag the nodes out and it will give you some options. Failing that, use search by double clicking empty field and search what fits.
>>
>>106670791
Are you new to this?
Just look at example workflows until it clicks.
Typically:
UNET > some optional model patching > sampler
Text encoder > positive and negative prompts > sampler
UNET and text encoder and can both be loaded independently or from a checkpoint
Then sampler will output processed latents
To decode that you load a vae (again either independently or from a checkpoint)
And that gets saved as image. (For videos, there might be more post processing like interpolation)
I don't feel like typing more detail but hopefully this helps.
>>
>press any key to continue
>press power button on my pc
>it turns off
i HATE comfy
>>
>>106665040
which nodeset is this from?
>>
>>106670857
Buttons aren't keys!
>>
>>106670860
nvm found it: https://github.com/BigStationW/ComfyUi-Scale-Image-to-Total-Pixels-Advanced

goated node
>>
File: ComfyUI_01190_.png (1012 KB, 1024x1024)
1012 KB
1012 KB PNG
>>
>>106670157
And porn?
>>
File: file.png (760 KB, 2632x1223)
760 KB
760 KB PNG
how to make it photorealistic saars pls hlp
>>
rentry wan guide
>never fucking worked or loaded
cool game
>>
>>106670923
>how to make it photorealistic
are you using the new qwen image edit model?
>>
>>106670916
Wan animate is basically useless for all but a narrow range of clips without excessive movement and lighting that won't make the character look like a bad green screen. It's a cool concept but in the end it's kind of grabo.
>>
>>106670935
yes
>>
>>106670355
Kijai should fork comfy and call it kijAI just for the lolz, 90% of the users would migrate
>>
File: ComfyUI_01191_.png (940 KB, 1024x1024)
940 KB
940 KB PNG
>>106670881
>>
>>106670939
just right, "make the image realistic"
>>
>>106670923
you can't add that many steps at once.
it is far better to do one prompt at a time, ie. place the char where you want and then do whatever else to them

you mong promptlet
>>
I don't understand Kijai's obsession with block swapping whatever the native comfy nodes use is so much more efficient
>>
>>106670857
>Press exe
>Send bitcoin to this address to get access to your computer
Comfy fuck man, this shit is the worst
>>
AI "slop" gets a bad name because of the inherent reddit clout-chasing nature of platforms like CivitAI and X.

All we see floating to the top of everyone's feeds is generic, safe, obviously-AI gens that get hundreds and thousands of upvotes.

But if you go to the model pages with sparse, recent submissions where it isn't a clusterfuck of people buying/farming clout and forcing their way to the top of the feeds with their bland vanilla mediocre gens, you'll see some amazing stuff. Real, actual art, usually with a controversial or violent streak and ZERO upvotes. I make sure to give them at least 1 and follow these obscure local genners
>>
File: file.png (731 KB, 2345x1231)
731 KB
731 KB PNG
>>106670955
what do you even mean??
>>
>>106670984
>denoise 0.88
wtf are you doing? don't touch that, put it back at 1
>>
>>106670984
please stop. it's evident you're just trolling now.
>>
>>106670984
>make the image a realistic
esl-kun
>fp8
>fp8
lol
>>
>>106670993
You've been saying everyone who has issues with the model is a troll. They can't all be trolls.
>>
>>106671000
to be fair there's not better than fp8 for the new qwen image edit right now
>>
>>106671000
my bad I rushed to get the result ready , shit takes forever
>>
>>106671001
what? this is the first time i've said anything to anyone using qwen.
the guy is very obviously fucking with us.
>>
>>106671006
fuck off
>>
Heads up, new qwen edit prefers left (1st image) and right (2nd image) over first and second image.
>>
File: file.png (704 KB, 2267x1286)
704 KB
704 KB PNG
it is even worse, maybe its something to do with the resolution of the initial img?
>>
>>106671011
stop fucking with the default settings and then being surprised why it doesn't work you stupid bastard.

>>106671013
you know this from testing or do they mention this somewhere
>>
>>106671024
try to remove the cfgnorm and the modelsamplingauraflox
>>
>>106671025
mm ok let me change everything back to default
>>
>>106671025
Testing and the fact the hugging face space examples explicitly use that language.
>>
>>106671024
I think you need to add this TextEncodeQwenImageEditPlus node for the new qwen model
https://github.com/comfyanonymous/ComfyUI/pull/9986
>>
File: 1755437112366107.webm (1.77 MB, 1920x1080)
1.77 MB
1.77 MB WEBM
What's the best way to extract this as pose for wan animate?
>>
File: ComfyUI_01994_.png (1.41 MB, 1024x1024)
1.41 MB
1.41 MB PNG
>>
File: file.png (740 KB, 2056x1233)
740 KB
740 KB PNG
still not there
>>106671060
how do I install that node?
>>
>>106671088
>update comfy
>double click on the empty space and search for "TextEncodeQwenImageEditPlus"
>>
File: worflow.jpg (266 KB, 717x1358)
266 KB
266 KB JPG
"When loading the graph, the following node types were not found.
This may also happen if your installed version is lower and that node type can’t be found.'

nothing in comfy is working
i updated but every workflow i grab just says im missing all this shitload of fuck
>>
Can anyone explain why loading a LORA at zero weight or not loading it all result in different images?
x should equal x + (y * 0), no?
>>
>>106671094
How do I do that???
>>
>>106671086
model?
>>
"Prompt execution failed
TypeError: Failed to fetch'
>>
>>106671122
Flux Krea
>>
Real retard hours
>>
>>106671088
>>106671105
you don't seem to need the new node (unless you want to go for multiple images), it seems to be working for your single image, and yes, the resultat isn't great, that's Qwen Image Edit, not Nano Banana lol
>>
>>106671139
stop trolling, anon.
>>
>>106671086
>>106671135
Hot damn I may have made a mistake in disregarding Krea. I just don't want to go back to flux.
>>
File: cloudfags.jpg (8 KB, 408x89)
8 KB
8 KB JPG
NON-LOCALFAGS ON SUICIDE WATCH
>>
Why does it always keep coming back to flux?
>>
>>106670550
Mine look deep fried
>>
>>106671176
I like it personally. Yes it is an overglorified LORA rather than a true novel checkpoint, yes I am a coping VRAMlet who can't into Qwen, but it can make comfy images.
>>
wan 2.1 = 31gb
wan 2.1 (distilled) = 18gb
i guess i have to retry tomorrow
>>
>>106671102
By weight I meant strength my bad.
In case it wasn't clear.
>>
File: 1747265336987381.mp4 (2.41 MB, 1008x480)
2.41 MB
2.41 MB MP4
>>106671079
it can't handle the spin jump
>>
File: Wanimate_00036.mp4 (1.36 MB, 832x480)
1.36 MB
1.36 MB MP4
>>
>>106670983
>inherent reddit clout-chasing nature of platforms like CivitAI and X
"Reddit" needs to be a noun here and not a gibberish adjective, sweetie. Clout-chasing predates civilization altogether.
>>
>>106671245
GEK!
>>
>>106671241
>>106671245
How?
>>
>>106671249
Ever heard of Cloutius Maximus?
>>
he's zapping to the extreme!
>>
>>106671095
I get this sometimes, I restart PC and it works again
>>
File: Wanimate_00064.mp4 (1.2 MB, 622x832)
1.2 MB
1.2 MB MP4
>>106671245
how do u make the output exactly the same length as input length?
>>
>>106671245
why stop there, just make a m2f transformation. Then we can all pluck out our eyes.
>>
File: Wanimate_000014.mp4 (1.62 MB, 1162x544)
1.62 MB
1.62 MB MP4
>>106671241
Pose strength 1.0
>>
can someone just post their 2.1 workflow? or link to one
>>
>>106671325
lmaooooooo
>>
>>106671325
KEK
>>
>>106671316
This is some high tech shit right here.
I would kill to have such technology back in 2000s, yet everyone is bitching about it. Go figure.
>>
>>106671327
https://civitai.com/models/1818841/wan-22-workflow-t2v-i2v-t2i-kijai-wrapper
>>
Does the new qwen use the same text encoder as the last one?
>>
in kijai nodes, how can you set the lora strength per context window? say your first prompt shouldnt use the lora but the next one should.
>>
>>106670613
poorjeets itt will run at q4, say there's "no noticeable quality reduction", and then proceed to call the model plastic slop. anyone who runs a model quanted should not be allowed to critique the model
>>
https://huggingface.co/QuantStack/Qwen-Image-Edit-2509-GGUF/tree/main

I dont think it's working, or it might need a comfy/node update.
>>
>>106671458
>anyone who runs a model quanted should not be allowed to critique the model
true that
>>
>>106671464
The other guy posting quants doesn't work either, its probably that the gguf node needs updating
>>
>>106671458
Both Q4 and distill lora, double whammy quality hit. I think people forget how bad distill lora's make a model lol
>>
>>106670613
that chart is like 6 months old
>>
>>106671474
And? Quanting leaves a mark on every model.
>>
>>106671474
and? the quants are still the same
>>
saaaar is same quality and fasterrr use speed lora lightning saaaaar
>>
>>106671474
AND? that changes nothing.
>>
>>106670613
>>106671487
>>106671488
>>106671504
quants don't work like that. There's a reason you always see this example. because if you actually try to replicate the experiment you'll see that the differences are neglible
>>
File: shut the fuck up.png (1.22 MB, 1200x664)
1.22 MB
1.22 MB PNG
>>106671516
>>
>>106671516
>differences are neglible
Qwen text capability got fairly brutalized going to Q6 what I tested. Image quality was roughly the same.
>>
>there are unironic 3060 poorfags who think that q2 outputs the same as fp16
>>
i just hate black miku isall
>>
File: 1753503772374653.png (756 KB, 1269x895)
756 KB
756 KB PNG
>>106671516
>the differences are neglible
for the text encoder it's a bad idea to go under fp8
>>
>>106670613
>holding a smartphone on her left hand
>and a multicolored ball on her right hand
>she has a red t-shirt
>neons
Holy ESL
>>
>>106669930
AniStudio is actually really good. Been busy but I'll have new builds coming up in a week or so.
>>
File: 00015-1746511260.png (1 MB, 1344x768)
1 MB
1 MB PNG
>>
>>106671535
Not talking about the text encoder at all, I use T5 at FP16. DiT models can be taken to Q4 with very little differences
>>
>>106671558
>DiT models can be taken to Q4 with very little differences
This is bullshit, you have a counterexample right in front of your face and you continue to lie. VRAMlets are delusional.
>>
>>106671573
>you have a counterexample right in front of your face
Where?
>>
File: 1257567068.png (1.15 MB, 880x1184)
1.15 MB
1.15 MB PNG
>>
>>106671516
Larger models can cope with it better and some variants like nf4 and nunchaku can provide better quality than run of the mill Q4 but it is absolute cope to say.
This is probably (You) bait but whatever.
>>106671535
Honestly I wouldn't go below fp16 unless you are scraping the bottom of the barrel for gen speed up.
Text encoding is the shorter part of the gen process and the time gain/degradation ratio is really inefficient if you are doing multiple seeds with the same prompt.
>>
>>106671577
>Where?
>>106670613
>>
>>106671585
Like I said there's a reason that's the only example you see. Next time instead of relying on internet slander you should just run the experiment youself
>>
>>106671601
>Next time instead of relying on internet slander you should just run the experiment youself
I'm literally the guy who created that example lol
>>
>>106671604
Then put the images in a catbox and I'll run it for 10 seeds on 10 random prompts sourced from a 3rd party dataset.
>>
the reason he can't provide a counterexample is because q4 is the only way he can run the model in the first place KEK
>>
>>106671613
you do it, you're the one who claimed it was an exception and that the rule is low variance, you have the burden of proof
>>
>>106671079
Code/name?
>>
File: Wanimate_00070.mp4 (2.8 MB, 622x832)
2.8 MB
2.8 MB MP4
how the fuck do I set the output video length the same as input??
>>
>>106671621
First post your images so we can see that you didn't fake the experiment
>>
>>
>>106671631
here's the workflow for fp16 and Q4_0
>fp16
https://files.catbox.moe/xx2kvr.png

>Q4_0
https://files.catbox.moe/fn58gx.png
>>
>>106671628
It's processed in batches determined by the "frame_window_size" on the WanAnimate Video Embeds node. If there's a slight overrun it's going to fill in the remaining frames.
If you have a 150 frame video set it to 50 for 3 batches of 50 frames
If you set it to 70 it's going to still do 3 batches, the last 40 frames being random bullshit
>>
File: 2564636572.png (1.07 MB, 832x1248)
1.07 MB
1.07 MB PNG
>>
Maybe I'm just crazy, but the new edit model works better without CFG?
>>
this is by far the best version of chroma imo https://civitai.com/models/1956921/chroma-dc-2k?modelVersionId=2214897
>>
>>106671725
usually, when someone wants to shill his product, he usually adds an image to show the capability of the model, what you did is lazy lol
>>
>>106671458
I've always done Q4 and nobody says my gens are plastic
>>
File: Wanimate_00123.mp4 (2.42 MB, 930x1280)
2.42 MB
2.42 MB MP4
>>106671738
was just stating found it far far better than the HD one
>>
>>106671770
how much better? show an example?
>>
>>
>>106671777
She reminds me of Greta-anon...
>>
is there a way to see if sage attention is working while genning?
>>
>>106671741
yeah im not sure what anons on about plastic. quants fuck up the prompt comprehension but thats it visually
>>
>>106671774
69%
>>
>>106671770
Everything is better than the HD one.
>>
>>106671796
I was expecting 420%, lame
>>
>>106671725
>>106671770
If it is producing the iconic chroma deformed anatomy that is worse than SD 1.5 noticeably less often I can be interested.
>>
>>106671658
This one belongs to Anime Diffusion Thread: sorry for posting my image here.
>>
>>106671741
any workflow for a vramlet?
>>
>>
What do I prompt
>>
>>106671853
WW2 propaganda
>>
>>106671738
He links to a civitai page with images, like dude
>>
We've already been there
https://files.catbox.moe/qphnpf.jpg
>>
File: 1745369411195266.png (621 KB, 832x1248)
621 KB
621 KB PNG
>>
File: RA_NBCM_00010.jpg (1012 KB, 1872x2736)
1012 KB
1012 KB JPG
>>
>>106670613
can you add nunchaku to this
>>
File: 1739403724689009.png (3.61 MB, 1416x2120)
3.61 MB
3.61 MB PNG
>>
bros I was so fucking used to qie nunchaku... the wait will kill me
>>
File: 1729700304589481.png (1016 KB, 832x1248)
1016 KB
1016 KB PNG
>>
>>106671963
kino
>>
Never used nunchaku. does flux loras work with nunchaku model? I'd guess no they don't
>>
>>106671981
No loras work with nunchaku lol
>>
>>106671770
Don't want to be that anon but as far back as v50 people were posting on HF, here and on leddit that the model was behaving weirdly. It's much easier to make slopped looking images on it and it will randomly blur outputs for no apparent reason. It's why lode caved and made v48 the "Base" model when he realized the HD training didn't work correctly.
>>
>>106671622
MMR-AK090 Ami Sasano
>>
which model or whatever should I install if i just want a quick in and out transform image to landscape task?
>>
>>106671983
this makes flux pretty useless with nunchaku
might try qwen then
>>
>>106671983
>No loras work with nunchaku
wait seriously? bruh...
>>
File: 1731284587147981.png (1.16 MB, 1248x832)
1.16 MB
1.16 MB PNG
>>106671970
>>
>>106672028
Idk if Flux has support but qwen definitely doesn't.
>>
>>106671622
>>106671993
desu, I only care about BBW Javs. In fact. I need BBWs. My favorite is Reo Fujisawa.
>>
File: 1753504354557029.png (2.99 MB, 2120x1416)
2.99 MB
2.99 MB PNG
>>
File: nunchaku flux lora.png (79 KB, 742x394)
79 KB
79 KB PNG
>>106671981
>>106671983
>>106672028
>>106672042
>>106672034
Flux nunchaku quants support loras, without major slowdowns
Note that the quality varies though, especially when using more than one.
>>
5090 nigga here
anyone got a wan 2.2 t2v and i2v comfyui workflow optimized for my level of card? Also anyone got advice on what model and encoder I should use specifically? I feel like I usually see workflows optimized for cheaper cards
>>
>>
>>106672042
https://github.com/nunchaku-tech/ComfyUI-nunchaku?tab=readme-ov-file
Qwen Image is mentioned here.
I haven't installed this yet so my knowledge stops here.
>>
File: ComfyUI_00236_.png (1.35 MB, 832x1216)
1.35 MB
1.35 MB PNG
>>106671983
bullshit, I made this one earlier today while I was testing nunchaku
>>
>>106671837
I decided to give it a try but saw that it is fp16 only.
If you decide to shill this more later, please make a Q8 beforehand, thanks.
>>
>>106672108
Nigga
https://huggingface.co/silveroxides/Chroma-Misc-Models/tree/main
>>
what was the prompt to add two images together in qie plus? like the one from the twitter post that i can't find right now
>>
File: Wanimate_00075.mp4 (3.73 MB, 740x1024)
3.73 MB
3.73 MB MP4
her boobs got smaller
>>
https://xcancel.com/ImperfectEngel/status/1970330695047561339#m
when you compare the 2 images, there's this fucking yellow tint, I thought they stopped training on 4o, c'mon chinks...
>>
>>106672125
Well how would I know, it wasn't linked there.
Alright thanks anyway.
Might make a post about what I think of it later.
>>
>>106672125
Well you might be in the know.
Do you know what's up with the "T2-SL4" variant?
>>
Less than 24h for wan 2.5, my dick is hard and ready, please infinite gens or at least 10sec+
>>
>>106672140
user error. Put balloons under your shirt and get to makin content bucko. The future of porn depends on fat sweaty old men shakin their butts covered in ridiculous prosthetics.
>>
>>106672175
>Less than 24h for wan 2.5
um source?
>>
Lmao should have backed up my Comfyui before updating everything. WanImageToVideo node maxes out at 24GB VRAM and either takes 5 minutes or hangs indefinitely now.
>>
>>106672200
A tweet
>>
>>106672200
https://www.timeanddate.com/worldclock/china/beijing
>>
File: file.png (947 KB, 1267x712)
947 KB
947 KB PNG
still have to generate the pose/depth controlnet ourselves right? not sure i follow what they are saying.
>>
File: Wanimate_00077.mp4 (2.53 MB, 1258x576)
2.53 MB
2.53 MB MP4
>>
>>106672240
>>106672240
>>106672240
>>106672240
>>
>>106672175
>please infinite gens or at least 10sec+
it'll be like Qwen Image Edit Plus, a small improvement, not a lot of time passed between Qwen 2.2 and 2.5
>>
>>106672243
>a small improvement
Nope, confirmed tons of important improvements if anything I have a suspicion that it may be closed/api only
>>
>>106672243
Wan 2.1 to 2.2 was a big improvement
>>
>>106672250
>confirmed tons of important improvements
name literally any
>>
>>106672254
>Wan 2.1 to 2.2 was a big improvement
9 months separate wan 2.1 and wan 2.2 though
>>
>>106672215
>>106672216
i was cereal
>>
>>106672337
https://xcancel.com/bdsqlsz/status/1969650994192794103#m
>>
>>106672344
I want to believe but "new open-source video model" doesn't necessarily mean wan 2.5
>>
>>106671842
Could do one, sure. I'll post the reply in the next thread. I'm on 12gb so if you're at 8 or less you can go with a smaller gguf or smaller text encoder, or even smaller gen size
>>
>>106672344
probably isnt want 2.5 if its "new" open source model, 2.5 would be like "updated" open source model
or im just overthinking it and ESL just wants to make it ez
>>
>>106672368
>>106672443
The bdsqlsz did go to a conference for wan 2.2 before it released and there was a discord screenshot that a gatekeeping dickhead eventually posted on here about 2.5 like 4 threads ago. Either way, we'll know for definite if it is or not soon
>>
>>106672518
>gatekeeping dickhead eventually posted on here about 2.5 like 4 threads ago
wow i missed that. cool. well those are some decent potential hints.
>>
File: AnimateDiff_00001.mp4 (1.16 MB, 512x480)
1.16 MB
1.16 MB MP4
I'm starting to wonder if the fact that the image source being AI made is fucking with the FFLF i2v getting so much colorshift.
But the shift also only happens when it is FFLF, if just First frame there's no colorshift.
With or without the Color Match node, the shift still happens.

For these loops I make with old reactionimages, the shift doesn't happen.
>>
I'm on 24gb vram and 32gb ram and when I gen, my ram usage is at 99% most of the time
Would I get a sizeable speed increase by upgrading to ram to 64?
>>
>>106673068
If you're running WAN 2.2, then yeah. It stores models in DRAM when they're not loaded into VRAM, so you're almost certainly paging each time the high or low ksampler starts.
>>
>>106673068
I'm in the same boat and my memory sits at about 55GB. Go a lot bigger if you can.
>>
>>106673175
yeah, definitely paging hard
>>106673183
higher than 55, or you mean higher than even 64?
>>
>>106673231
Higher than 64GB (which is what I currently have). Go for 96GB, 128GB or 192GB (or even 256GB if you're a baller) instead. 64GB is just the new minimum and will be more of a side-grade for you.
>>
File: ComfyUI_06728_.png (1.34 MB, 1248x1472)
1.34 MB
1.34 MB PNG
I am in Japan now. most anistudio work while I'm here will just be cmake and splitting things off into shared libs. sorry I haven't been active on the repo recently but I'll be back at it. wish me luck with softbank fundraising!
>>
>Somebody else here found that you need to update your ComfyUI and replace your text encode nodes with TextEncodeQwenImageEditPlus. I'm testing it and it seems to be working.

for edit v2
>>
File: 1727971482734294.png (1.14 MB, 1024x1024)
1.14 MB
1.14 MB PNG
>>106673385
the anime girl is waving hello.

works, unless you change the node to the new one it will be random noise. also, the node has 3 image inputs so it should be easy to do multi input stuff.

with: Qwen-Image-Edit-2509_fp8_e4m3fn.safetensors
>>
File: 1732423733969932.png (789 KB, 1024x1024)
789 KB
789 KB PNG
>>106673415
test 2: connect another load image node to image 2

the teal hair anime girl is shaking hands with the pink hair anime girl.

amazing, no more image stitch bullshit or concatenate jank, it just works with an image node.
>>
File: 1741950085997571.png (1.05 MB, 1024x1024)
1.05 MB
1.05 MB PNG
>>106673424
the teal hair anime girl is sitting at a table in a coffee shop with the pink hair anime girl.
>>
File: 1741858370591127.png (1.05 MB, 880x1176)
1.05 MB
1.05 MB PNG
>>106673430
the two japanese women are standing in an empty classroom in Japan.

source images: anri and anri (diff pic)
>>
File: 1754025836118363.png (1.16 MB, 880x1176)
1.16 MB
1.16 MB PNG
>>106673441
the japanese woman is standing in a japanese hot spring wearing a white bikini.

for a cropped photo it did really good desu
>>
>>106673451
and yes, qwen-image-edit-remove_clothes.safetensors still works if you want to do that.

https://files.catbox.moe/y5y946.png
>>
Wan 2.5
>https://x.com/Alibaba_Wan/status/1970405877778915523

inb4 x.com, just type xcancel
>>
>>106673776
nigga you DID type it and then you deleted it for some retarded reason



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.