[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: 1745985557520.jpg (1.57 MB, 2303x2854)
1.57 MB
1.57 MB JPG
Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev:>>106613605

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI:https://github.com/comfyanonymous/ComfyUI
SwarmUI:https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo:https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next:https://github.com/vladmandic/sdnext
Wan2GP:https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training:https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2122326
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond:https://rentry.org/comfyui_guide_1girl
Tag Explorer:https://tagexplorer.github.io/

>Misc
Local Model Meta:https://rentry.org/localmodelsmeta
Share Metadata:https://catbox.moe|https://litterbox.catbox.moe/
GPU Benchmarks:https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt:https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin:https://github.com/Acly/krita-ai-diffusion
Archive:https://rentry.org/sdg-link
Bakery:https://rentry.org/ldgcollage

>Neighbours
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
>>106618946
Due to unforeseen circumstances there were multiple good video gens that were left out of the collage. My apologies, anon.
>>
they should rename nunchaku to nonechaku
>>
>>106618968
it's already dubbed copechaku
>>
Blessed thread of frenship
>>
>>106618967
you can stick your sorries in a sack, mister
>>
where are all the miraculous snake oil promising +50% speedup and -50% vram usage?
they used to be a weekly thing, but now nothing happened for weeks
>>
tfw you find a neat artist to train on
>>
File: 1_00027_.png (581 KB, 1024x1024)
581 KB
581 KB PNG
>>
>>106619040
people finally caught on
>>
File: 1_00039_.png (855 KB, 1024x1024)
855 KB
855 KB PNG
>>
>>106619040
yeah, I kinda miss that, they were snake oil but you didn't know it first and it was a good way to pass some time testing them out
>>
File: 1_00034_.png (1.19 MB, 1024x1024)
1.19 MB
1.19 MB PNG
oh it's a magazine
>>
File: 1_00046_.png (1.36 MB, 1024x1024)
1.36 MB
1.36 MB PNG
>>
>>106619040
Well light lora did make 50min video gens take 5 minutes while preserving 95+% of the quality and 80+% of motion if not more today, especially if you really need something complex and create a more advanced workflow that doesnt use the lora for the first couple of steps

Copechaku also does wonders for what you can do in 8-12gb vram although not worth for anyone who can run q6-q8
>>
File: wan.jpg (39 KB, 658x657)
39 KB
39 KB JPG
>>106619040
>Wan nunchaku/Radial Attention (perpetual distracted by shiny new image models)
>Jenga
>Taylor seer
>DualParal
>DCM

These were all for 2.1 by the way. 2.3 or whatever they decide to number it will release before any updates are made to the above list. 100%
>>
File: 1757517359420468.gif (2.8 MB, 278x498)
2.8 MB
2.8 MB GIF
What 3D generator can I use for free?
>>
kris...
>>
File: 1_00055_.png (1.15 MB, 1024x1024)
1.15 MB
1.15 MB PNG
>>
File: 1_00012_.png (1.41 MB, 1024x1024)
1.41 MB
1.41 MB PNG
>>
>>106619131
too soon
>>
File: 1_00057_.png (1.37 MB, 1024x1024)
1.37 MB
1.37 MB PNG
>>
>>106619100
why post this garbage man? where the fuck is she sitting? her hands are not even on the wheel brah
>>
You can make your comfy more comfy (still, not too comfy)
https://github.com/comfy-deploy/comfydeploy
>Full environment control - Install any custom node and models, and have it immediately availble for your team.
>Sharable Workflows - If it works for you, it will work for them — environment guaranteed
>Workflow Versioning - Safely iterate with complete version history and one-click rollback
>>
File: 1740475601974806.png (1.28 MB, 1008x1008)
1.28 MB
1.28 MB PNG
>>106619152
>>
File: 1736189202353320.jpg (1.61 MB, 2016x1152)
1.61 MB
1.61 MB JPG
>>
>>106619100
>>106619182
She looks vaguely asian so she is /ldg/ approved. You may continue,
>>
File: 1_00065_.png (1.37 MB, 1024x1024)
1.37 MB
1.37 MB PNG
>>106619208
>>
File: 1749693077860262.png (1.26 MB, 1344x672)
1.26 MB
1.26 MB PNG
>>106619210
>>
>>106619252
Can you have him airdrop giant rocks on a chibi Ani's head?
>>
File: 00_00060_.png (1.11 MB, 1024x1024)
1.11 MB
1.11 MB PNG
>>
File: ComfyUI_00349_.png (1.84 MB, 1328x800)
1.84 MB
1.84 MB PNG
is this enough to fool youtube's age verification system?
i doubt they do any kind of actual checks especially for third world ids
>>
>>106619311
this looks like shit and won’t pass as an estonian id lol
>>
nunchaku qwen lora?
>>
File: comfy51.jpg (2.65 MB, 2000x2342)
2.65 MB
2.65 MB JPG
>>
is Ani here?
n*gbo has a question >>106619256
>>
>>106619311
miku looking good for a 45 year old man
>>
>>106619321
is debo retarded? ggml invented gguf
>>
File: ComfyUI_00323_.png (1.63 MB, 1248x832)
1.63 MB
1.63 MB PNG
>>106619318
>>106619327
heh
>>
File: 1730508088301730.png (2.2 MB, 1328x1328)
2.2 MB
2.2 MB PNG
>>106619268
>>
THREAD CHALLENGE:
- Generate a 'strong' colorful-dyed-short-hair cyberpunk brown lesbian that you don't immediately hate.

(I haven't been successful yet)
>>
File: ComfyUI_00354_.png (1.54 MB, 1328x800)
1.54 MB
1.54 MB PNG
tintsune
>>
>>106619395
>lesbian
i will hate her by default
>>
>>106619395
>short-hair
>lesbian
well yeah, lesbians with long hair don't exist, I'm being serious
>>
File: 00_00071_.png (1.27 MB, 1024x1024)
1.27 MB
1.27 MB PNG
>>
>>106619424
lesbian is just a visual style in this context

It's amazing how unsexy big tits are on the wrong person. This 'look' is a black hole into which all sexuality disappears...
>>
>>106619495
>cunny issue
>hag on the cover
rope
>>
>>106619502
She looks pretty good for being a 7000 year old demon.
>>
File: 1742071547845048.png (2.47 MB, 1328x1328)
2.47 MB
2.47 MB PNG
>>106619501
slopped
>>
Do I need to use the sage attention node in a work flow or is just using the --use-sage-attention argument good enough?
>>
File: 1752540283203023.jpg (1.95 MB, 2016x1152)
1.95 MB
1.95 MB JPG
>>
>>106619541
dont use the argument, just use the node. from my understanding native nodes to do that are coming soon, for now use kijai's (bad) patching nodes, not you need to actually set them to disabled instead of bypassing them to restore normal operations
>>
File: 00_00077_.png (1.02 MB, 1024x1024)
1.02 MB
1.02 MB PNG
>>106619395
>>
>>106619560
what kind of shitty model is this
>>
File: ComfyUI_00365_.png (1.77 MB, 1328x800)
1.77 MB
1.77 MB PNG
>>
>>106619557
Thanks. I was just wondering cause I only really do video gen on comfy and never disable sage.
>>
Still trying to figure out how to do this successfully...
>>
>>106619652
>512x768
>>
>>106619568
Seedream 4.0
>>
>>106619711
Yes, I chose that resolution on purpose.
>>
>>106619722
And what is the purpose? What dog model is that anyway?
>>
>>106619731
Nevermind didnt read the filename
>>
File: 00397-672133736.jpg (642 KB, 2048x2480)
642 KB
642 KB JPG
You have no fucking idea how much Chroma pisses me off and how little anyone knows about training these loras
>>
>>106619731
>he can check file dimensions but can't check filename
>>
>>106619747
>60 steps
I want to gen in 8 seconds max thoughbeit
>>
>>106619748
The image autoexpands on the screen where I see it's small.
Anyway drop the positive prompt for your fag butch lesbians so I can gen you something with non-retard settings.
>>
>>106619762
So do I'm tempted to go back to noob. The model feels incomplete and the things I feared came to pass. I would like for him to improve the lighting model and I would make loras for that.
>>
>>106619711
>>106619731
>tfw genning 640 x 480

And there is not a thing you can do about it
>>
>>106619801
I can laugh, lmao.
>>
File: 1737434212017023.mp4 (2.87 MB, 1024x768)
2.87 MB
2.87 MB MP4
the JIGGLE lora is the best phat ass lora for wan
but it makes them have tattoos too often :/
>>
>>106619770
Looks good on my 110ppi monitor. Feel free to ctrl+scroll wheel to get a closer look

but I'll humour your request for prompt:
>blurry digital photo of supermodel Silvia Zapata's subtle cleavage at a cyberpunk expo in Buenos Aires, october 2015. She just turned 19 and she's cute as fuck, a tall dark latina with an edgy short haircut dyed in aquamarine and eyebrow piercings. Her deceptively simple iridescent silk qipao with loose transparent pvc sleeves is so fucking cool and her minimalist avant-garde makeup is next level. She has a braided nylon sash with a holstered gun in a 3d-printed holster. She has a slim girlish figure with a tiny waist. There's a dark olive heavy velvet curtain behind her.
>>
File: nomorepls.webm (1.09 MB, 432x504)
1.09 MB
1.09 MB WEBM
>>106619320
>>
Seedream is insanely good. China won the API war
>>
>>106619949
Wrong race of the guy
>>
File: WanVideo2_2_I2V_00430.webm (1.98 MB, 1024x1024)
1.98 MB
1.98 MB WEBM
>>
>>106619747
>You have no fucking idea how much Chroma pisses me off
As a Chroma anti, I do.
>>
>>106619974
>I2V
real image or did you gen it too
>>
>>
File: 1741808823146885.png (2.31 MB, 2304x2304)
2.31 MB
2.31 MB PNG
>>106619395
>photograph of a 'strong' colorful-dyed-short-hair cyberpunk brown lesbian
>>
>>106619987
It was
>>106619005
>>
File: 00_00090_.png (1.19 MB, 1024x1024)
1.19 MB
1.19 MB PNG
>>
>>106619920
Still can't gen without the slow mo?
>>
In case the next op (understandably) hates brown cyberpunk dykes, here's another chesty teacher to improve my chances of getting in
>>
File: 00_00093_.png (1.07 MB, 1024x1024)
1.07 MB
1.07 MB PNG
>>
File: 1754257685300285.mp4 (3.15 MB, 1024x768)
3.15 MB
3.15 MB MP4
>>106620027
sorry i was catching up on the previous thread and posted my big dumper nigresses and then replied to that before i went through this thread

he looks almost like tony soprano. even the "ugly" men imagegen models make are photogenic. hopefully one day AI learns how to make diverse ranges of ugly bastards, but probably not with the transformers architecture

>>106620053
>Still can't gen without the slow mo?
actually, I haven't tried out some of the things anons itt suggest, like turning off the 2.2 lora on high noise only. i'll be able to dedicate more time next week i've basically just been queueing up a few prompts at a time and trying out recent loras ive downloaded

i've been completely gaslit and reverse-gaslit about high and low models. sometimes I want huge cleavage in bikinis, so i only run my big tits lora on high and not low because running it on low makes the nipples show or sometimes fuses with the fabric which is not ideal. only running it on high helps a lot but sometimes the booba isnt big enough. i should probably move on to the m4crom4stia lora or whatever too since i'm still on the wan 2.1 big breasts lora

but now with lightning i hear that you shouldnt run certain things on high. guess i need to do my own research
>>
File: 00_00096_.png (1.16 MB, 1024x1024)
1.16 MB
1.16 MB PNG
>>
>>106620059
prompt?
>>
>>106619920
can you sharelu yourr catboox gallery righ now? not motivated to check the digusting b
>>
>>106620133
don't have one to share right now
even if i did i have 3 days worth of backlog so i would upload that first
>>
>>106620114
Apologies for the pornographic language:
>blurry digital photo of a very young teacher's cleavage in a cute cardigan over a maxi dress. She's really pretty. At the front of a public school classroom with a blackboard, in Charlottetown PEI, 2012. She's 26 years old and new to teaching. Her sloppy boobs make her look downright fuckable. She's looking at the class and doing a lesson, leaning in slightly.
>>
File: 00_00104_.png (468 KB, 1024x1024)
468 KB
468 KB PNG
>>
>>106620159
thx
>>
blurjeeta? my beloved?
>>
File: 00_00107_.png (938 KB, 1024x1024)
938 KB
938 KB PNG
>>106620159
>>
File: 1751888466633408.jpg (17 KB, 360x360)
17 KB
17 KB JPG
>>106619920
zamn did ya hear what dat lil ytboi just called us?
>>
>>106620174
also the cfg, although I think these values are a little bit resolution-dependent, was 2.2, and the sampler was euler A with 0.38 eta noise, 1.01 sigma noise, and 40 steps sgm uniform.

>>106620187
Yes, hello.
>>
>>106620059
where were these broads when i was in schooling
>>
File: 00_00117_.png (633 KB, 1024x1024)
633 KB
633 KB PNG
>>
>>106620230
>where were these broads when i was in schooling
you didnt go through puberty yet so you never noticed in elementary school
or you're a mutt and everyone was fat
>>
File: 00_00128_.png (830 KB, 1024x1024)
830 KB
830 KB PNG
>>
File: 00_00148_.png (1.01 MB, 1024x1024)
1.01 MB
1.01 MB PNG
>>
wow i posted a single video and the entire /b/ thread got nuked literally 1984 do your jobs you lazy faggot mods
>>
>>106620159
this is a pretty unique prompt style. it's like casual boomerprompting. have you found more success with this prompt style compared to other types of prose? feels like it has a large chance of just wasting tokens or getting schizo results
>>
File: WanVideo2_2_I2V_00426.webm (694 KB, 768x1056)
694 KB
694 KB WEBM
>>
>>106620449
I don't boomerprompt the way I used to. You have to adapt to the model. Here it's a combination of some lightly boomerish prompting-by-feel but mostly a solid base of autism autocaption prompting. "[type of image] of a [subject]'s [main focus] [prominent specific details] . . . [single prominent background object] . . . [what subject is doing, pose]"

You can't really avoid doing that with Chroma, it's obviously not like SD1.5. But you can get away with a little experimenting in there as long as you've got the important bits in place, and you don't let the prompt get too long.
>>
File: 00_00162_.png (1.09 MB, 1024x1024)
1.09 MB
1.09 MB PNG
>>
I hate realizing these VLM captions aren't 100% accurate but I'd rather drag my nuts through glass than check even a mere 50 captions.
>>
>>106620528
its ok, the base model's training data's captions aren't 100% accurate either

our perception of reality isn't 100% accurate either. if it were we wouldnt have arguments about what "realism" is or looks like
>>
>>106620509
>>106620449
also a relevant tidbit of /sdg/ history: the term "boomerprompting" was actually coined by another anon (explicitly he presented it as a new concept with a theory in a post) to describe my prompts a few years ago, which were for a week or so a subject of discussion (this was back when by default the prompt was the filename, and mine were pretty distinctively different which prompted others to experiment with it). I disagreed a little then and still do with the idea, which doesn't even remotely describe how or why I prompt the way I do, but still to this day it gets identified with the style of my prompts, which haven't completely changed.
>>
File: RA_NBCM_00002.jpg (1.13 MB, 1872x2736)
1.13 MB
1.13 MB JPG
>>
>>106620556
sometimes i wonder about the "hidden history" that never gets written down.

like, no history book will record that as a result of gay people in video games and celebrities being sluts with their iphones, 4chan's userbase doubled and donald trump won his first term as a result

i even noticed this with the term "slop". slop came from here which came from goyslop obviously, but normies quickly rejected that origin story not even because they suck Israel's penis or something obviously but just because they didn't want to accept that history as true
>>
>>106620556
I remember you, with those "my cousing debbie looked so good in that dress, she just turned 19" something like that, really good
>>
I was ready to give up on Chroma but it turns out using res_2s and beta57 actually makes it kinda usable... who would have thought
>>
>>106620601
i stopped being shocked by the mass adoption of underground internet culture around that time nicki minaj used pepe on instagram
>>
>>106620586
nice
>>
File: 00_00168_.png (1.64 MB, 1024x1024)
1.64 MB
1.64 MB PNG
>>106620626
>"my cousing debbie looked so good in that dress, she just turned 19"
Blast from the past.
>>
comfyUI is leaking memory. fix it
>>
>>106620640
>i stopped being shocked by the mass adoption of underground internet culture around that time nicki minaj used pepe on instagram
one of my gym bro friends finally used "mogged" for the first time IRL and I was instantly transported back to my 2018 /fit/ days
4chan slang that's more than 1 level deep takes on average 3-7 years to get mainstream i've noticed
>>
File: 00_00177_.png (1.28 MB, 1024x1024)
1.28 MB
1.28 MB PNG
>>106620658
Works on my machine
>>
>>106620556
>and [my prompts] were pretty distinctively different which prompted others
a prompter's prompts prompting prompters to prompt prompts

>>106620601
>normies quickly rejected that origin story not even because they suck Israel's penis or something
they do though actually.

>>106620691
she cute
>>
>>106620699
>they do though actually.
bro fucking /pol/ of all places likes Israel more than the average person in the US or Europe under 30 what the fuck are you talking about right now
>>
>>106620658
Remove the --fast cope
>>
>>106620752
The cope part comes when people say it's good without comparing the quality vs speed with and without
>>
>>106620710
oy, nebekh! what a terrible shoah is happening in Palestine, I heard it on The Adam Friedland Show
>>
File: ComfyUI_00218_.png (2.98 MB, 1280x1920)
2.98 MB
2.98 MB PNG
>>
File: RA_NBCM_00005.jpg (843 KB, 1872x2736)
843 KB
843 KB JPG
>>
>>106620759
>comparing the quality vs speed with and without
i agree if you use fp16 models, but with GGUF and a fp16 text encoder i shave like 30 seconds off my offloading with neglible loss in prompt understanding (definitely less than using a fp8 scaled text encoder) and no effects on video quality/coherence
>>
>>
File: 00_00188_.png (976 KB, 1024x1024)
976 KB
976 KB PNG
>>
File: 1727369368524197.png (15 KB, 894x773)
15 KB
15 KB PNG
>>106620714
I'm not using that doe
>>
>>106620845
then use --cache-none to turn off their automatic memory management

if that doesn't fix it, you've got bigger problems
>>
File: crusade.jpg (931 KB, 2712x2072)
931 KB
931 KB JPG
>>
File: RA_NBCM_00008.jpg (667 KB, 1872x2736)
667 KB
667 KB JPG
>>
Already sped up my workflow 5x thanks to api nodes. Delivering results to clients even faster now. Cloud compute will be necessary going forward if you want to remain competitive
>>
>>106620986
does it auto censor things
>>
>he's still at it
The madman
>>
>>106621008
Depends on the model you pick. western shit like google and bfl might, but seedream has no censorship. It’s all about knowing which model is right for the task
>>
File: kek.png (1.15 MB, 1120x1120)
1.15 MB
1.15 MB PNG
>>106620986
id rather save up for a 6000 pro
>>
File: 1740695169580788.png (1.03 MB, 832x1248)
1.03 MB
1.03 MB PNG
>>
>>106621038
Except it goes out of date while cloud compute remains the same, or even gets cheaper over time. Just like all those 8x3090 llm rigs people built 2 years ago: already abysmally slow for any modern inference task
>>
>>106620986
True competitive advantage comes from the deepest recesses of my twisted creative mind
>>
File: 00043-2941639059.png (999 KB, 824x734)
999 KB
999 KB PNG
>>
>>106621057
Anon, I'm just making images at home to amuse myself. You think I'm trying to run local LLMs
>>
File: 1746365371336300.png (2.87 MB, 1416x2120)
2.87 MB
2.87 MB PNG
>>
>>106618946
API here API there, cloud here, cloud there, obvious samefag shilling comfyui, welp, into the filter ldg goes now alongside sdg, it was good while it lasted.
>>
File: 00051-2082299397.png (982 KB, 1024x1024)
982 KB
982 KB PNG
>>
>remain competitive
I just want to coom
>>
>All it takes to upend this general is a couple of obviously satirical remarks about API nodes now and again.
>>
File: 1754698454329475.png (2.06 MB, 1120x1440)
2.06 MB
2.06 MB PNG
>>
File: 00_00200_.png (809 KB, 1024x1024)
809 KB
809 KB PNG
>>
File: 1755958830550827.mp4 (3 MB, 720x912)
3 MB
3 MB MP4
hatsune miku runs on the beach

neat
>>
File: 00_00209_.png (1.49 MB, 1024x1024)
1.49 MB
1.49 MB PNG
>>
>>106621275
she looks a bit different, are you sure that's all you prompted?
>>
is there a decent way to get facial consistency across short clips without using a lora?
im doing like 10-15 second videos is all i really need right now, training a lora for wan 2.2 i2v seems like a hassle, surely theres a better way to keep the faces the same for the duration of the video?
>>
File: WanVideo2_2_I2V_00431.webm (1.07 MB, 1024x1024)
1.07 MB
1.07 MB WEBM
>>106621072
>>
>now and again.
>>
File: 00_00214_.png (996 KB, 1024x1024)
996 KB
996 KB PNG
>>
File: 00022-3329399466.png (2.69 MB, 1248x1824)
2.69 MB
2.69 MB PNG
final figured out how to get forge couple region prompter to work. Time for some mommy hugs :)
>>
File: WanVideo2_2_I2V_00432.webm (899 KB, 768x1312)
899 KB
899 KB WEBM
>>
File: 00025-3202785678.png (2.16 MB, 1248x1824)
2.16 MB
2.16 MB PNG
>>
File: ComfyUI_07095_.png (1.81 MB, 1152x1152)
1.81 MB
1.81 MB PNG
>>
File: ComfyUI_07098_.png (2.38 MB, 1152x1152)
2.38 MB
2.38 MB PNG
>>
>>106621654
Disgusting porn
>>106621659
Cute
>>
>>106619040
SRPO just came out bro. Now it just needs to be applied on relevant models and not flux.
>>
File: 00033-2468380678.png (2.42 MB, 1344x1728)
2.42 MB
2.42 MB PNG
>>
I just want you to know your fetish is illegal and immoral.
>>
I just want you to know your fetish is based and supported.
>>
File: output-00011.png (1.24 MB, 1024x1024)
1.24 MB
1.24 MB PNG
>>
File: 00007-3160605783.jpg (479 KB, 1344x1728)
479 KB
479 KB JPG
>>
>>
File: output-00016.png (894 KB, 832x1216)
894 KB
894 KB PNG
>>
>>
File: output-00020.png (1.28 MB, 832x1216)
1.28 MB
1.28 MB PNG
LLL
>>
File: output-00026.png (716 KB, 832x1216)
716 KB
716 KB PNG
upscaling doesnt seem to work or rather it doesnt seem to sample after, also no latent upscale but its not that bad
>>
File: ComfyUI_02573_.mp4 (2.26 MB, 896x896)
2.26 MB
2.26 MB MP4
>>106621659
>>
what's the recommended way to upscale? just x2 the latent, denoise 0.5 and use the same conditioning and model?
Or should I use a dedicated upscaling model (potentially faster but shittier?)
>>
>>106621654
nice view
>>
>>106621980
I use UltimateSD. If you see tiling or artifacts lower cfg
>>
>>106621980
latent with cnet is unmatched
>>
>>106622036
Tile to be specific. Shit is magic.
>>
>>106621654
Great :D
>>
>>106622036
>>106622039
+ promax model
>>
>>106622264
>>106622036
I was looking into this, I see that there's the unionx pro tile cnet for flux dev, for xinsir's any sdxl checkpoint would work I guess?
>>
>>106622313
Should be yeah
>>
Just ask seedream to make it bigger
>>
>>106622345
Can I just make a silent prayer or do I need to voice my wish out loud?
>>
>>106622036
workflow?
>>
any guide on how to produce fp8 scaled quants?
>>
>>106622566
You can scale in unet model loader
>>
>>106622627
I don't think that's what he was asking.
>>
>>106622638
He was asking for trouble and got it
>>
File: 1745460950266650.png (119 KB, 1113x626)
119 KB
119 KB PNG
>>106622638
>>106622655
yeah well I just vibecoded it (tm), hopefully it works lmao
>>
File: 0001.png (1.66 MB, 1152x1440)
1.66 MB
1.66 MB PNG
oo bby
>>
>>106622681
>I want to use a worse model version
why tho
>>
>>106622698
Hercules looking a little worse for wear these days.
>>
No one here use SDXL anymore?
>>
>>106621584
>>106621736
mommyyyyyy

can you do Queen ,arika like this?
>>
>>106622745
ffs, I meant Queen Marika
>>
Is there a local version of gemini's nano banana?
>>
>>106622766
qwen edit
>>
>>106622702
because im a vramlet??? hello???????
>>
>Use a character lora I found on civitai
>Get mutants when upscaling
>Look up metadata
>Trained at 768x768
Please, if you are using an illustrious finetune model, train at 2048x2048 on illu 2.0 like any normal human being in 2025, else your lora won't survive upscaling
Who still train loras for SDXL at sub 1024 in the fucking current year
>>
>>106622782
>what is q8
>>
>>106622866
but goofs are scary to make...
>>
>>106622878
very squishy beans
>>
If I want Qwen image to understand more concepts do I use a better quant of Qwen-VL or of Qwen image?
>>
no you train a lora for it
>>
>>106622766
yeah just use api nodes
>>
>>106619557
Kijai basically use Python introspection to change the regular comfyui attention functions into Sage Attention. It's a tremendous hack, everyone agree it's a hack, but it also works and has worked for close to 6 months now. It's a "temporary hack solution" which has been going for far too long and works far too well.

It restores your original attention functions after a single run, so you don't even need to do anything after you use it. KJ (hack) sage attention nodes just works (though the node is disabled by default, you need to manually select you want to use it in the dropdown menu, which is arguably retarded).

In any case, KJ attention nodes works ...By patching in real time your runtime functions, and unpatching it. But it fucking does work, has been working for what, one year? If you select yes in the drop down menu about "ARE YOU REALLY SURE ABOUT using it", of course.
>>
>>106623000
I never said it doesn't work, but doing that sort of monkey patching is generally bad from a pure programming and maintanability pov... short of reworking how comfy works under the hood (which it did: https://github.com/comfyanonymous/ComfyUI/pull/9639 hence why a lot of custom nodes broke). I'm not sure if it's exposed in the UI (via node or setting, didnt honestly explore this)
>>
>>106622908
Prompt comprehension is always tied to the clip/text encoder, it's the model that is used to control the Diffusion model in order to understand your prompt.

Getting better image quality is absolutely trivial. You can unironically train any diffusion model to make any style, and training absolute photorealism is 90 hours of GPU time. Or training it in Van Gogh. Or your favorite waifu. All of that is not hard.

Training better prompt comprehension is absolutely hard, and that's because it's all in the text encoder/fucking clip-l/t5. People tried, but mostly failed. I mean people tried hard. We've got people who mapped every single output of the clip-l, every single bit added of context by the t5, retrained models-hard.

To answer your question, if you want better prompt comprehension (but not better image quality) use a better Qwen-VL quant. Though there is a limit. Diffusion models are trained together downline of their text encoders, you can't simply change a text encoder with the other, and you can't simply take a better text encoding model and hope it will work with your diffusion model. I say this because people are trying to use the Delitized (obliterated) Qwen-VL for Qwen Image, and as it happens, Qwen Image has never been trained for or with the Delitized Qwen-Vl. So it doesn't change anything.
>>
>>106622878
they aren't on huggingface?
>>
>>106623070
not for all models sadly, hence why I was asking
DUH
>>
>>106623014
If monkey patching is the only way to fix your fucking software, and it has been going for an entire year, and it does work, and no one gives a fuck about fucking patching your software in order to get this trivial function __natively__?

Then there is something seriously wrong with your software, software model development, there is something tremendously wrong with something happening under the hood. Point is, KJ sage attention monkey patching just work, and has just worked for one year now. Comfyui cannot in their right mind muster the willingness to do it right. Protip: who is in the wrong isn't KJ.
>>
>>106623030
Thank you anon
>>
>>106623115
idk why youre getting all worked up over this, I've just said that doing it that way is a hack, I've been using the node myself.
Regarding 'b-but the base software is shit, they didnt have the foresight to add the attention mechanisms earlier!' is a really dumb take, shows that you're a no-coder. Assumptions around architecture are ever moving. The fault isn't with the codebase itself, but with the comfy devs taking their sweet time to implement it. If you check the PR you'll see that it was mostly busywork as they had to change each model to properly forward the transformer_blocks param, so it could be changed at any point (and a couple function to register flash/sage attn).
are you kijai perchance? lol
>>
>>106623164
>are you kijai perchance? lol
Nta, why do people always do this when someone is too passionate/hateful/enthusiastic about a topic?
>>
>>106622841
idk, i just use the onetrainer preset for sdxl because getting information or asking for help always gets ignored or scoffed at because everyone assumes you should know everything already
>>
>>106623082
>not for all models sadly
What mememodel are you using that isn't on HF?
>>
File: 1748724386721647.png (27 KB, 412x321)
27 KB
27 KB PNG
>>106623178
I mean, did you see how he replied to me?
>anon asks --should I use -sage-attn--
>I say no, just use the node, but its kinda bad dont forget to set it to off if you want to stop using sage attention, pic related
>other anon chimes in saying it's a hack, but it works
>I say yes it works, yes it's a hack, but it's better to do it in a clean way, and it appears the devs finally moved their asses last week
>other anon starts a rant about how comfy's codebase is garbage, KJ ISNT AT FAULT!!!
>no one ever claimed faults of anyone, mine were pure observations
I dont understand the fanboyism desu, like they're against X or in favour of Y for the most stupid quasi-political reasons. These guys (comfy/kijai) produce tools, I use their shit and that's it.
>>
>>106623273
and btw, kj's shit is broken right now
https://github.com/kijai/ComfyUI-KJNodes/pull/386
I assume this is for the model loader itself which I dont use
>>
Does it even matter if I use the sage argument or patcher node? Is there a quality difference?
>>
Anyone seen this glitch before?

nsfw https://files.catbox.moe/2xlbaq.mp4
>>
>>106623333
kys my dude
>>
>>106623333
Fuck off idiot
>>
>0.4 denoise
>perfect result
>try it again
>mangled, schizophrenic anatomy and extra arms
i do not comprehend
>>
File: AnimateDiff_00121.mp4 (1.73 MB, 720x720)
1.73 MB
1.73 MB MP4
>>106623346
>>106623380
>>
File: wan.jpg (56 KB, 723x608)
56 KB
56 KB JPG
New wan soon "apparently"

>>106622735
Havnt for a month or two, even deleted most of the models. Chroma does what I wanted sdxl/pony to do
>>
>>106621334
cool
>>
>>106623453
>wan 2.5
I wonder if they'll finally fix the inconsistent speed issues, and maybe even go up in native resolution
>>
File: c.png (1.27 MB, 832x1488)
1.27 MB
1.27 MB PNG
>>106622735
no, some do

but of course there is a lot of newer stuff to use or try out
>>
>>106622735
i do
>>
File: 00172-2022762841.png (1.52 MB, 768x1344)
1.52 MB
1.52 MB PNG
>>
File: 00003-1845440674.png (1.02 MB, 1024x1024)
1.02 MB
1.02 MB PNG
>>
how long does it take to do 480p/720p wan 2.2 gens on a 5090?
>>
>>106622735
Until we get a new good anime model, and that's a big if, I don't think I will ever use anything but sdxl for weebshit
>>106623453
I would rather get longer videos than better resolution and voices
>>
>>106623775
>Until we get a new good anime model
what more do you need?
>>
>>106623685
a few minutes.

with lightx2v maybe around two minutes or so
>>
Does anyone know why inpainting in swarmui always come out blank? Im using illustrious and whatever I do, it doesnt work.
>>
So does civit still accept nsfw loras or not? Where can I upload my shit?
>>
>>106623782
>NL so that it can do complex stuff and not limited by danbooru tags
I know you can cheat with CN, but it's annoying to do that every time you want something which isn't in tags
>Multi characters without bleeding and needing to inpaint every time
>Better VAE
>Better native resolution
>Some kind of character reference like NAI have, where you don't need a lora if you have 1 pic
>Better text
>>
>>106623782
Not shitting itself when you start demanding anything more complex.
>>
>>106623826
it does depending on what it is
>>
>>106620601
>like, no history book will record that as a result of gay people in video games and celebrities being sluts with their iphones, 4chan's userbase doubled and donald trump won his first term as a result

4chan is tiny against all other social media and is astronomically small against the size of the electorate
>>
>>106623839
>>106623860
post a single thing that you can't do already with what is currently available.
>>
Trying to make a personal Ami lora, really feels like I'm in the woods with the 5090, the real mind fuck is realizing the HD model for chroma is garbage and getting better results with normal chroma
>>
>>106623923
I've been trying to tell that to people lol
>>106613626
>>
>>106623931
>Base/2K + flash lora is a good speedy starting point
can you post a workflow? preferably also without flash lora
>>
>buy a 5090
>still stuck with 5 second gens
what the fuck did i just buy this for
>>
>>106623946
https://files.catbox.moe/tg9laa.png
And delete the Mahiro node because it sucks balls.
>>
>>106623685
With lightx2v loras, like 4min at 720p, like a fourth at 480.
rawdogging at 20steps, maybe like 8 times longer.
>>
>>106623931
I don't see the purpose of HD for anything, Why the fuck did the creator shill that piece of garbage when the outputs look objectively worse and the model is heavier. I fucking tried with that model but you can still train loras at 1024 with base and it will look better. He's just throwing shit at a wall hoping it sticks
>>
>>106624024
>https://files.catbox.moe/tg9laa.png
is that what you're prompting with chroma? isnt this easily doable with illustrious/noob?
>>
File: comfy0123456.jpg (2.79 MB, 2300x2373)
2.79 MB
2.79 MB JPG
>>106619949
anon... why...
>>
>>106624110
It's just my test prompt I have stashed in a text file.
>>
>>106624110
You don't need anything other than SDXL. Every new model is worthless or barely looks any better.
>>
>>106624110
To do most of chroma's composition requires regional prompt and in painting. It's easier to use chroma than to rig all that stuff together when the model can do it. I'm really disappointed with chroma but have to remind myself the shit tier state XL and other models were. At the very least this knows every godless sex act known to man so we can train loras and use them as tags
>>
>>106624152
which chroma version do you recommend? there's like 60 versions
>>
>>106624152
>To do most of chroma's composition requires regional prompt and in painting
Not it fucking doesn't promptlet retard
>>
>>106624198
Yes it fucking does genius prompt expert
>>
none of you are doing ANYTHING that requires so much autism. all of you are just genning 1girl slop or 5 second goon videos in i2v
>>
>>106624268
got a problem with that?
>>
>>106624268
im genning /a/ shitposts thougheverbeit
>>
>>106624268
pipe down nogen
>>
File: 1735330763756187.mp4 (1.73 MB, 640x640)
1.73 MB
1.73 MB MP4
the white hair anime girl with a black blindfold sits down at a round table and places her drink on the table.

wan 2.2, no high 2.2 lightning, 1.0 str low lightning lora, interpolated

no high = proper motion
>>
>>106624033
even if you're using q8 i think theres a problem with your wf since it takes me 5min to gen on a 3090 q8 light 4 step lora and film vfi upscale 1280x720
>>
>>106624268
nogen's got some spunk, do you know were in the sam hell you is boi?
>>
File: 1740424250430461.mp4 (1.76 MB, 640x640)
1.76 MB
1.76 MB MP4
>>106624379
the white hair anime girl with a black blindfold sits down at a round table, puts her drink on the table, and waves hello.
>>
>brave /adt/ gooner tries tranistudio
>it went how everyone told trani countless times
>>106624176
>>
>>106624535
wow it's nearly like it's complete shit made by a complete schizophrenic.

REALL WEIRD, REALLY MAKES YOU THINK INNIT
>>
>>106624535
kek
>inb4 singular schizo anon used haxx to forge this webm in bad faith + "ranfag"
>>
>>106624490
nice boat
>>
>>106624428
post your workflow
>>
>>106624535
negotiating commercial licenses of AniStudio(tm) for my 20 anime companies right now
>>
>>106624268
>1girl slop or 5 second goon videos in i2v
all you need tbqhwys
>>
>>106624619
https://civitai.com/models/1818841/wan-22-workflow-t2v-i2v-t2i-kijai-wrapper
loop i2v workflow
change fp8 scaled with q8 and set quantization to disabled on those same loader nodes
change lcm sampelr to dpm sde
and i also disabled the florence model auto-create-prompt section and manually set the prompt
>>
>>106624428
I'm a beginner, but I don't think anything stands out, right? Been a while since I had my wf checked for retardism.

Also, why did the save output break? It worked yesterday.
%date:yyMMdd-hhmmss%/output
>>
>>106624535
>not using tiled vae
do retards really
>>
>>106624691
>>106624663
>>
>>106624701
how little ram do you have that you need to resort to tiled vae?
>>
>>106624691
It looks like you've got a WanVideoNAG node leading into another WanVideoNAG node? I don't think that's right
>>
>>106624691
>blur everything
>half of the pixels are shades of white
>half of the pixels are shades of brown
kek, its either pretty obvious what youre prompting or youre trolling
>>
>>106624379
what settings do you pick for high without lightning? sa_solver 20+ steps?
>>
>>106624731
considering how slow his inference was he is poverty level
>>
>>106624760
it's the default kijai 2.2 workflow using dpm++ sde, 6 steps (3/3)
>>
>>106624758
>>106623333
I'll never do blacked. I do orcs instead.

>>106624721
I started with his stuff, but I kept getting tons of color shifts and flashing stuff. I don't understand why this one is much better at that.

>>106624733
Oh shit, you're right. I just wanted to test it out and I couldn't tell if I noticed a difference, so I just left it.
>>
File: 1756244749458263.mp4 (897 KB, 576x480)
897 KB
897 KB MP4
>>
File: 1736902690437375.mp4 (587 KB, 576x480)
587 KB
587 KB MP4
>>106624784
better migu
>>
>>106624761
nah anistudio just really is super slow, at least 5 times slower than the same workflow on comfy
>>
>>106624780
>I do orcs instead.
adjacent, so you're trolling or deluded
>>
>he isn't using softmax
>he isn't using flash attention
>he didn't tile the vae
surprised you can do anything in comfy either desu
>>
File: file.png (904 KB, 639x822)
904 KB
904 KB PNG
w-what do three lines mean?
>>
>>106624830
source?
>>
>>106624535
Surprised it actually starts
It just crashes for me on windows and ubuntu while comfy and even auto1111 work on the same boxes
>>
File: comfy519.jpg (2.28 MB, 2048x2048)
2.28 MB
2.28 MB JPG
what if we kissed under ComfyUI nodes?
>>
>>106624879
you'd become an API node
>>
File: 1734866248338782.mp4 (841 KB, 672x480)
841 KB
841 KB MP4
>>
>>106624879
>two nuclear-grade kriptonite weapons of mass destruction for White men
>>
>>106624869
so why have you never left an issue with the terminal/cmd output or ever prove that this is actually happening? why would ani give a shit about your shit box if you never have receipts?
>>
>>106624899
>korean
nah
>>
>>106624663 (me)
just tested some wan samplers, dpm sde is ok but unipc seems the best against body horror
>>
>>106624939
why are you talking about yourself in the third person

why do you delete all issues made on git
it's nearly like you're insane or someting.
haha couldn't be though, right?
you're a normal functioning member of society.. right?
>>
Anistudio in the next OP yes?
>>
File: 1743409211626994.mp4 (1.31 MB, 672x480)
1.31 MB
1.31 MB MP4
>>106624895
>>
>>106624982
>why do you delete all issues made on git
lmfao what the fuck are you even talking about. post receipts
>>
anyone tried empty prompt on WAN?
>>
>>106624009
>what the fuck did i just buy this for
you just bought jensen another genuine leather jacket of course, cuck.
>>
>>106624535
Thanks for linking this
Guess i will just stay comfy then
>>
>>106625103
it's honestly hilarious you can't provide reciepts
>>
>>106625117
?
>>
File: 1755428120876283.mp4 (1.16 MB, 672x480)
1.16 MB
1.16 MB MP4
BUY THE GPUS

OR ELSE!
>>
>>106625120
you are the retarded fud anon that complains it doesn't run but can't provide the debug output
>>
>>106624997
>post receipts
now try in english. thanks.
>>
anistudio is really fast and I didn't have to think at all to install it
>>
SaaS won
>>
>>106625151
>>106625151
>>106625151
>>106625151
>>106625151
>>
>>106623775
Yea, hopefully they can work their magic so it takes less vram and introduce some new tech that allows for proper extension.

>>106624009
kek, just use the new native context nodes, useful for repetitive movements, you have 32gb so should easily go beyond to 25 second gens. my 4070tis can only get to 13 seconds.
>>
File: f.mp4 (1.31 MB, 640x360)
1.31 MB
1.31 MB MP4
>>106624774
ty, maybe i can make more stuff work



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.